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Cn ' Abstract. We establish various results on the structure of approxi- 

^..i mate subgroups in linear groups such as SL„(fc) that were previously 

C^ ' announced by the authors. For example, generalising a result of Helf- 

gott (who handled the cases n = 2 and 3), we show that any approx- 
imate subgroup of SL„(F9) which generates the group must be either 
very small or else nearly all of SL„(F5). The argument generalises to 
other absolutely almost simple connected (and non-commutative) alge- 
braic groups G over a finite field fc. In a subsequent paper, we will give 

-^ " applications of this result to the expansion properties of Gayley graphs. 
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1. Introduction 

The purpose of this paper is to study so-called approximate subgroups 
of linear groups. In particular, we will study approximate subgroups of 
absolutely almost simple algebraic groups over an arbitrary field k, such as 
SL„(A:). (We review the definition and basic properties of absolutely almost 
simple algebraic groups in Section [5l) These results were announced in [15j; 
closely related results have also been independently announced in |50J- We 
give an application of these results here to a conjecture of Babai and Seress 
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[3]; in a subsequent paper pLTj we shall also apply these results to obtain 
expansion properties for various families of Cayley graphs. 

We begin by recalling the notion of an approximate subgroup, first in- 
troduced (in the non-abelian setting) in [56j. See [30j for a more extensive 
motivating discussion. 

Definition 1.1 (Approximate subgroups). Let K ^ 1. A nonempty finite 

set A in some ambient group G is called a i^-approximate subgroup of G if 

(i) It is a symmetric subset of G, by which we mean that if a & A then 

a~^ G A, and that the identity lies in A; 
(ii) There is a symmetric subset X C G with \X\ ^ K such that A- A CI 

X ■ A, where A- A = {aia2 : ai,a2 € A} is the product set of A with 

itself. 
We will refer to K -approximate subgroups informally as approximate groups 
when the constant K and the ambient group G are either irrelevant to the 
discussion, or are clear from context. 

Note in particular that a 1-approximate subgroup of G is the same thing 
as a finite subgroup of G. For the rest of the paper we will assume that 
K ^ 2. In this regime there are ii'-approximate subgroups which are not 
necessarily close to genuine groups, the simplest example being that of a geo- 
metric progression {(/" : \n\ ^ N}. There also exist higher-dimensional and 
nilpotent generalisations of this. Again, [30] may be consulted for further 
discussion. 

Many papers have been written in which the aim is to classify a certain 
class of approximate subgroups. For example, the Freiman-Ruzsa theo- 
rem [51] provides a description of approximate subgroups of the integers 
Z. "Classification" in this context must be interpreted quite loosely. The 
following notion of control., first introduced in [57], has proved useful in this 
context. 

Definition 1.2 (Control). Suppose that A and B are two sets in some 
ambient group, and that K ^ 1 is a parameter. We say that A is K- 
controlled by B, or that B X-controls A, if \B\ ^ K\A\ and there is some 
set X in the ambient group with \X\ ^ K and such that A C (X-B)n{B-X). 

Given this definition, one may describe the classification problem for ap- 
proximate subgroups as follows: given some class 'if of approximate sub- 
groups of a given group G, find some smaller, more highly-structured, class 
of approximate subgroups 'rf' such that every object in 'if is efficiently con- 
trolled by an object in '^'. In the next section we describe our results of 
this type in the case that the ambient group G is an almost simple algebraic 
group, such as SLd{k). 

NOTATION. The letter C always denotes an absolute constant, but different 
instances of the notation may refer to different constants. If G depends 
on some other parameter (for example, if we are working in SL„, C might 
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need to depend on n) then we will indicate this dependence with subscripts. 
If j4 is a finite set then \A\ denotes the cardinality of A. For non- negative 
quantities X,Y, we use X<YoTY>Xto denote the estimate X ^ K'-^Y, 
and X ~ y to denote the estimates X < Y < X. The symbol p always 
denotes a prime number, and ¥p denotes the field of order p. Finally, we 
use A^ := {ai . . . a„ : ai,...,a„ G ^4} to denote the n-fold product set 
of a collection A of group elements, noting that if A is a ET-approximate 
subgroup then \A^\ ^ i^"~-'^|74| for all positive integers n. 

If a, 6 € G, we write 6° := a~^ba, and more generally if B,A C G, we 
write B'' := {b" : b £ B}, 6^ := {b" : a e A}, and B^ := {b" : a £ A,b £ B}. 
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Pyber and Szabo [50] have independently announced a set of results which 
have significant overlap with those presented here, in particular establishing 
an alternate proof of Theorem 12.31 which also extends to cover all finite 
simple groups of Lie type. There are some similarities in common in the 
argument (in particular, in the reliance on Lemma l5.3p but the arguments 
and results are slightly different in other respects. 



2. Statement of results 

In a celebrated paper [32], H. Helfgott provided a satisfactory solution to 
the classification problem for the group SL2(Fp). His methods adapt easily 
to (and in fact are rather easier in) SL2(C), a setting also studied by Chang 
[20) . Helfgott's arguments give the following result. 



Theorem 2.1 (Helfgott). Suppose that A C SL2(A;) is a K-approximate 
subgroup of SL2{k). 

(i) Ifk = C, then A is K -controlled by B, an abelian K-approximate 
subgroup ofSlj2{k); 
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(ii) Ifk = ¥p, then A is K -controlled either by a solvable K -approxim- 
ate subgroup of SL2{k) or by SL2{k) itself. 

Helfgott's theorem has found many applications, for example to proving 
that certain Cayley graphs are expanders [6] and to certain nonlinear sieving 
problems [8j. For these applications, only the following somewhat weaker 
statement is necessary. 

Theorem 2.2 (Helfgott). Suppose that A C SL2(Fp) is a K -approximate 



subgroup that generates SL2(Fp). Then A is K'-^ -controlled by either {id} 
bySL2{¥p) itself. 



or 



Recently, but before our work, this result had been extended to fields ¥g 
of prime power order q = p^ hy Dinai [23]. An extension to SL„(Fp) in the 
case of small A (specifically, \A\ ^ p^+i-o for some 6 > 0) had been obtained 
by Gill and Helfgott [28j. 

The proof that such a statement suffices for the sieving work of Bourgain, 
Gamburd and Sarnak [8] is contained in a very recent preprint of Varjii 
|62j ; the original argument of [8j required a careful analysis of the proof of 
Helfgott's result. 

Our first main result generalises Theorem 12.21 as follows. 

Theorem 2.3 (Main theorem). Let k be a finite field and let G be an abso- 
lutely almost simple algebraic group defined over k. Suppose that A C G{k) 
is a K -approximate subgroup that generates G{k). Then A is i(''-'dim(G) . 
controlled by either {id} or by G{k) itself. 

Remark. When we first announced this result in [15], we restricted G 
to be an (implicitly /c-split) Chevalley group. Simultaneously with this an- 
nouncement, Pyber and Szabo [50] also announced Theorem 12. 3[ in the more 
general setting of arbitrary finite simple groups of Lie type. Subsequently, 
while writing this paper, we realised that the argument did not use the k- 
split assumption anywhere in the proof, and in fact holds in the generality 
stated above. 

Remark. Note that the constant CdimfG) does not depend on the field k. In 
particular, this theorem can be applied with G = SL^ for any fixed d, and the 
constants now only depend on d. In the case G(A;) = SL3(Fp) with p prime, 
this result was established previously in [33]. Our arguments share some 
features in common with those in [33], most notably an emphasis on upper 
and lower bounds on the intersection of A with maximal tori, and the use 
of a "pivot" argument inspired by proofs of the sum-product phenomenon. 

Theorem 12.31 will be proven in Section [5l In fact, we will prove a more 
precise result. Theorem 15. 5[ in which A does not need to generate G(A;), 
but merely has to be sufficiently Zariski dense in the sense that it is not 
contained in a proper algebraic subgroup of G of bounded complexity (we 
will make these notions more precise later). Then the conclusion is that 
A is controlled by either {id} or by the group (A) it generates. With this 



APPROXIMATE SUBGROUPS OF LINEAR GROUPS 5 

generalisation, the field k can now be taken to be an infinite field, such as 
C. 

We will deduce Theorem 12.31 from Theorem 15.51 in Section [5l 
As mentioned earlier one may use the arguments in [7] and the later paper 
|62j together with Theorem 12.31 to generalise the aforementioned results on 
expanders and on the affine sieve. A particular application is to showing 
that the Suzuki groups (a family of simple groups constructed as subgroups 
of the symplectic groups Sp4(Fq), q = 2^"+^) can be made into expanders. 
This result (established in our companion paper [16]) removes the lacuna in 
|37j . which established that the family of all non-abelian simple groups other 
than the Suzuki groups can be made into expanders. In another forthcoming 
paper |T^, we will discuss in detail the application of our results and those 
of Pyber and Szabo to expansion in groups of Lie type in general. 

By combining Theorem 12.31 with standard results from noncommutative 
product set theory (see j56j) we obtain the following alternative formulation 
of our main theorem. 

Corollary 2.4. Let d E N. Then there are e{d) > 0, C^ > such that for 
every absolutely almost simple algebraic group G with dim(G) ^ d defined 
over a finite field k, and every finite subset A in G{k) generating G{k), and 
for all < € < €{d), one of the following two statements holds: 

(i) \A\ »rf \Gik)\^-^'^'; 
(ii) l^^i ^ |^|i+.. 

Proof. If conclusion (ii) does not hold, then {A U A^^ U {id})^ is a K'-'- 
approximate subgroup of G := G(A;) of cardinality at most i^ |A| for some 
absolute constant C > and K = max(2, 1^1*^) (see [531 Corollary 3.10]). 

We are thus in a position to apply Theorem 12.31 If e is small enough 
depending on i^'^dim(G)^ [^ jg ^q^ possible for (^U^"^U{id})^ to be i^'^dim(G). 
controlled by {id} for cardinality reasons unless \A\ = Odim(G)(l); ^^ which 
case (i) or (ii) is clear (for e sufficiently small) just because the fact that A 
generates G(A;) implies that either ^ = G, or |^^| ^ |^| + 1. Thus we need 
only consider the case when A is controlled instead by G = G(A;), in which 
case conclusion (i) easily follows. D 

The fact that the above results extend to more general simple algebraic 
groups than SL„ conveys certain advantages. Over C, for example, the 
general structure theory of algebraic groups implies, roughly speaking, that 
every closed connected (in the Zariski topology) subgroup of GL„(C) admits 
a quotient by a closed normal solvable subgroup which is an almost direct 
product of simple Lie groups over C. By exploiting this theory, the gener- 
alisation of Theorem 12.31 mentioned earlier, and the main result of ll^ ■ we 
are able to establish the following result. 

Theorem 2.5 (Freiman-type theorem in GL„). Suppose that A C GL„(A;) 
is a K-approximate subgroup o/GL„(/c), where k is a field of characteristic 
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zero. Then A is On{K"^^') -controlled by B, an On{K"^^') -approximate 
group that generates a nilpotent group of nilpotency class {i.e. step) at most 
n — 1. 

This result is proven in Section [6l As a straightforward corollary we 
obtain the following special case of Gromov's theorem [31j, first established 
by combining the Tits Alternative [59] with a theorem of Milnor [45j and 
Wolf Ei 



Corollary 2.6 (Gromov's theorem for linear groups over C). Suppose that k 
is a field of characteristic zero and that G is a (finitely- generated) subgroup 
of GL ii{k) with polynomial growth. Then G is virtually nilpotent. 

We will recall the various notions mentioned here in ^ where the corol- 
lary is proved. 

In the special case where A is contained in SL2(C) or SL3(Z), Theorem 
12.51 was established by Chang [20] . The particular case of subsets of SL2 (M) 
was established earlier by Elekes and Kiraly ^24j. Note that, by the well- 
known fact that every finitely generated field of characteristic zero embeds 
in C, the proof of Theorem 12.51 is immediately reduced to the case k = C 
We are also able to say something about the structure of ii'-approximate 
subgroups of GLn{k) where A; is a finite field, at least in the case k = ¥p, 
but it is not yet clear to us what the final form of such a result will be. We 
hope to address these issues in a subsequent paper. 

Qualitative forms of the above theorems follow from the work of Hrushovski 
[35j (see in particular [35) Theorem 1.3] and [35^ Corollary 1.4]). The main 
novelty of our work lies in the polynomial dependence on the approximation 
parameter K, which is absolutely essential for applications. 

The dependence of constants on the dimension n (or on dim(G)) in the 
above theorems are in principle explicitly computable. However, if one is 
willing to sacrifice such information, a key portion of the argument (the 
proof of the Larsen-Pink inequalities, described in §3]) can be significantly 
simplified by the use of an ultrafilter argument; this is the approach we have 
chosen to take in this paper. 

We remark that a key ingredient in Helfgott's work was the sum-product 
theorem in Fj, [10] . Our approach does not use this result (and neither does 
that of Pyber and Szabo). Moreover, we may in fact reverse the implication 
and deduce the sum-product theorem from the so-called Katz-Tao lemma 
[38j and Theorem 12.31 

Theorem 2.7 (Sum-product theorem over Fp). Let p be a prime, and sup- 
pose that A is a finite subset of¥p such that |^ • A|, 1^4 + ^41 ^ -f^l^l- Then 
either \A\ ^ K'^ or \A\ ^ K^'^p. 

We give the deduction of Theorem 12.71 from Theorem 12.31 in Section [8l In 
principle one could also establish a very general form of the sum-product 
theorem, namely that every "approximate subfield" of some field k is close to 
a genuine subfield of k. Such a result is stated for instance in [58] Corollary 
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2.56], and is essentially contained in [HI [10]. To obtain such a result by 
applying our arguments here would require the classification of maximal 
subgroups of SL2(A;), which is rather more complicated than the classification 
of subgroups of SL2(Fp). Therefore we do not give any such derivation here, 
leaving the details to the interested reader. 

As remarked in the introduction, an immediate application of our results 
(noted in this context by Helfgott) is to cases of a conjecture of Babai and 
Seress on rapid generation in nonabelian finite simple groups. We state and 
prove these results in ^ 



3. Algebraic varieties 

In this section we review the definition and basic properties of algebraic 
varieties and algebraic groups. This material is needed in Sglbelow. Readers 
may wish to refer to that section now for additional motivation. 

In the classical presentation of the theory of algebraic geometry, varieties 
are defined at a qualitative level, with considerations such as the degree 
of the polynomials used to define the varieties being secondary to the main 
theory. However, due to the quantitative nature of our analysis, we will need 
to present the foundations of algebraic geometry in a similarly quantitative 
manner, in particular assigning a "complexity" to each of the algebraic va- 
rieties that we encounter. The precise definition of this complexity will not 
be important for uqj, but we will still need to fix one such definition of this 
concept in order to proceed with the rest of the proof. 

We now quickly review the foundations of quantitative algebraic geometry, 
but with the proofs of various complexity bounds deferred to Appendix El 
Our presentation here will be classical in nature, viewing algebraic varieties 
in terms of solutions of polynomials in afhne (or projective) space, rather 
than by the modern machinery of schemeqj- In particular, the basic notion 
of a complexity of a variety will depend not just on the abstract variety itself, 
but also on how we choose to embed it into an afhne or projective space. 

Throughout this discussion k will be an algebraically closed field. We 
let A'"{k) = /c" be the standard n-dimensional affine space over k, and 
P"(/c) = ik"'^^\{0}) / {k\{0}) the standard projective space. Observe that 
P"'(fc) can be covered by n+1 copies Aq (/c), . . . , A^{k), where A"(/c) is formed 
from F"'{k) by deleting the i^^ coordinate hyperplane. 

Definition 3.1 (Varieties). Let M ^ 1 be an integer. 



If however one wished (say) to effectively control the constants Cdim(G) in Theorem 12.31 
then one may wish to pay more attention to exactly how complexity is defined. 
^Of course, it is quite likely one could recast the arguments here in scheme-theoretic 
language, although one might have to take care with issues of multiplicity when perfoming 
such tasks as counting the number of points of intersection \A n V\ between a finite set A 
and a variety or scheme V. One would also require a more intrinsic notion of complexity. 
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(i) An affine variety over k of complexity at most M is a subset V C 
A" (A;) of the form, 

V = {xe A"(fc) : Pi(x) = . . . = Pra{x) = 0} 

where ^ n,m ^ M , and Pi, ... , Pm '. A"(A;) — t- k are polynomials 
of degree at most M . 
(ii) A projective variety over k of complexity at most M is a subset 
V CP"(A;) of the form 

V = {xe P"(fc) : Pi(x) = . . . = P„,{x) = 0} 

where ^ n,m ^ M , and Pi, ... , Pm '■ fe"^"*^ — )■ k are homogeneous 
polynomials of degree at most M. 
(iii) A quasiprojective variety over k {or variety for short) of complexity 
at most M is a set of the form V\W C P"(A;), where V,W are 
projective varieties of complexity at most M . 
(iv) A constructible set over k of complexity at most M is a boolean 
combination of at most M projective varieties of complexity at most 
M. 
We omit the phrase "over k " when the underlying field is clear from context. 
The Zariski closure of any subset of a constructible set V is the intersec- 
tion of V with all the projective varieties containing that set. We say that 
a set A is Zariski dense in a variety V if its Zariski closure contains V . A 
variety W is closed in another V if its Zariski closure in V is equal to W . 
A variety is irreducible if it cannot be expressed as the union of two 
proper closed varieties (of any complexity). In particular, an affine {resp. 
projective) variety is irreducible if it cannot be expressed as the union of two 
proper affine (resp. projective) varieties. 

Remarks. An affine or projective variety of complexity at most M is auto- 
matically a quasiprojective variety of complexity Om(1)) and every quasipro- 
jective variety of complexity M is a constructible set of complexity Om(1)- 
Thus we may work without loss of generality with quasiprojective varieties 
or with constructible sets. If we allow M to be arbitrarily large, then these 
quantitative notions of a variety coincide with the usual ones in classical 
algebraic geometry. 

The intersection or union of two affine varieties V,W C A" (A;) of complex- 
ity at most M will be another affine variety of complexity Om(1)- Similarly, 
the Cartesian product of two affine varieties V, W of complexity at most M 
be an affine variety of complexity Om(1)- Analogous statements hold for 
projective and quasiprojective varieties (using the Segre embedding to model 
Cartesian products), and for constructible sets, with the exception that the 
union of quasiprojective varieties is merely a constructible set rather than a 
variety in general. 

Our notion of complexity bounds the ambient dimension n, the maximum 
degree d of the defining polynomials, and the number m of such polynomials. 
As it turns out, the third bound is essentially redundant, as it is controlled 
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by the first two. Indeed, the space of polynomials of degree at most d that 
vanish on V is a linear subspace of the space of all polynomials of degree at 
most d on the afhne space A'^(k), and thus has dimension Od,n{^)- 

We now give a useful result, that asserts that the Zariski closure operation 
is well-behaved with respect to complexity. 

Lemma 3.2 (Zariski closure preserves bounded complexity). Let V d W 
he constructihle sets of complexity at most M. Then the Zariski closure of 
V in W has complexity at most Om(1)- 

Proof. See Appendix E □ 

Next, we recall the notion of a regular map between varieties, but with a 
notion of complexity attached. 

Definition 3.3 (Regular map). Let V C P"(A;) and W C P™(A;) he varieties, 
and let M ^ 1. A map f : V —?■ W is said to he regular with complexity 
at most M if V,W are individually of complexity at most M , and if one 
can cover V hy some varieties Vi, . . . ,Vr of complexity at most M for some 
r ^ M such that 

(i) for each 1 ^ j ^ r, f{Vj) is contained in an affine space Af^^k) C 
P™(/c); 

(ii) the map /|v^. has the form {Pj,i/Qj,i, ■ ■ ■ , Pj,m/Qj,m), where the 
Pj,hQj,l '^'^6 homogeneous polynomial maps from /c"+^ to k with 
deg{Pj^i) = deg{Qj^i) ^ M, and the Qj^i are non-vanishing on Vj. 

A regular map (p : V ^ W is dominant if V is irreducible and (piV) is 
Zariski- dense in W . 

Again, if we allow M to be arbitrarily large, then this notion of a regular 
map coincides with the usual one in classical algebraic geometry. 

The class of all varieties and the regular maps between them form a 
category. This category is well-behaved with respect to complexity: 

Lemma 3.4 (Composition). The composition of two regular maps of com- 
plexity at most M is of complexity Om(1)- In particular, the restriction of a 
regular map of complexity at most M to a subvariety of complexity at most 
M is a regular map of complexity at most Oj\/(l). 

The image (or preimage) of a variety V of complexity at most M hy a 
regular map of complexity at most M is a constructihle set of complexity 
Oa/(1). In particular, by Lemma\KM the Zariski closure of this image is a 
variety of complexity Om{^)- 

Proof. See Appendix |Al D 

Informally, the first part of Lemma 13.41 asserts that the class of bounded 
complexity varieties and bounded complexity regular maps also form a cate- 
gory 
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To each non-empty (closed) affine variety V C A"(A;), we can assign a 
dimension dim(y), which is an integer between and n, and can be defined 
as the maximal length d of a proper nested sequence C Vq CI . . . C V^ C F 
of irreducible (closed) affine varieties in V. The dimension of a projective 
variety is defined similarly. The dimension of a quasiprojective variety or 
constructible set is then defined as the dimension of its Zariski closure. We 
adopt the convention that the empty set has dimension —1. 

We recall the following basic estimate. 

Lemma 3.5 (Weak Bezout). Let V be a 0-dimensional constructible set of 
complexity at most M. Then V is finite with cardinality Om(1)- 

Proof. See Appendix El □ 

Recall that a property holds for generic points in an variet}0 if it holds 
at all points outside of a variety of strictly smaller dimension. We introduce 
a quantitative notion of this concept: 

Definition 3.6 (Generic points). If M ^ 1, and V is a variety of complexity 
at most M , we say that a property P{x) holds for M-generic points x ^V 
if it holds for all points in V outside of a subvariety U of V of complexity 
M and dimension strictly less than that of V. The set V\U will be called 
an M-Zariski open subset ofV. 

If (/) : V ^ W is a regular map, then it is well-known that (piV) has 
dimension less than or equal to that of V. Furthermore, the fibres 4i~^{x) 
generically have dimension dim(Ty) — dim(y). These facts behave well with 
respect to complexity, as the following lemma shows. 

Lemma 3.7. Let V, W be varieties of complexity at most M , and let <j) : 
V ^ W be a regular map of complexity at most M. Then there exists an 
OMi^)-Zariski open subset V' ofV, and a subvariety W' ofW of dimension 
at most dim(y), with the following two properties: 

(i) (Generic mapping) (piV) C W. 

(ii) (Generic fibres) For any w E W , the set {v ^ V' : (/>(«) = w} 
is a constructible set of complexity Om(1) o-iT'd dimension at most 
dim{V)-d[m{W'). 
We refer to W' as an essential range of (j). 

Finally, if (p : V —^ W is a dominant map, then we may take W' to be a 
OMi^)-Zariski open subset ofW. 

Proof. See Appendix El D 

Remark. If V is irreducible, then we can automatically make (j) dominant 
by replacing W with the Zariski closure of (j){V) (using Lemma 13.41 to keep 



With this convention, if the variety is reducible, then any lower-dimensional components 
of the variety will not be considered generic. Note that this is distinct from other conven- 
tional definitions of genericity in the literature. 
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the complexity bounded). On the other hand, if V is not irreducible, then 
the (Zariski closure of the) essential range need not be unique. Indeed, 
consider the example V = ({0} x k) U {(1,0)} C A^(A;) with the projection 
map (j) : A'^{k) — >■ A^{k) defined by (/){x,y) := (j){x). Then both {0} and 
{0, 1} would qualify as essential ranges. 

Lemma 3.8 (Slicing lemma). Let V,W be varieties of complexity at most 
M , and let S he a subvariety ofVxW of com,plexity at m,ost M and dimen- 
sion strictly less than dimiV) + dim(VF). Then for Om{^) -generic v G V, 
the set {w & W : {v,w) € S} is a constructible set of complexity Om(1) and 
dimension strictly less than diin(W). 

Proof. See Appendix S Q 

Algebraic groups. We now recall the definition of an algebraic group, 
again with a notion of complexity attached to it. 

Definition 3.9 (Algebraic groups). Let M ^ 1. An algebraic group G 
of complexity at most M over an algebraically closed field k is a variety G 
of complexity at most M which is also a group, with the group operations 
■ : GxG ^f G and ()~^ : G ^^ G given by regular maps of complexity at most 
M . If G is an irreducible variety, we say that G is a connected algebraic 
group. 

Remark. Recall that we are taking a classical algebraic geometry view- 
point here, so that our algebraic groups G are not abstract, but instead 
come equipped with an embedding into affine or projective space. This is 
necessary in order to make the notion of complexity well-defined. 

Remark. Our main theorem Theorem 12.31 deals with linear algebraic 
groups, that is algebraic groups whose underlying algebraic variety is affine. 
Nevertheless a fair amount of what we do in this paper, all of §4 and Appen- 
dix!^ and in particular the key Larsen-Pink inequality, hold for arbitrary 
algebraic groups including for instance abelian varieties. 

Remark. If a connected linear algebraic group is nilpotent or semisimple, 
and the underlying field has characteristic zero, then it is possible to show 
that after a change of variables (which may have unbounded complexity), 
one can find an isomorphic copy of this group whose complexity is bounded 
in terms of its dimension only. However this is not true in general, and in 
particular fails for solvable groups over C. To see this, consider, for each 
fc G N, the subgroup Gk of GL3(C) defined by 



Gfc := < I x" t \ ■.xeC'',t€ 




One easily checks that Gk is a connected solvable algebraic group of dimen- 
sion 2 whose center is isomorphic to the group of /c-roots of unity. Lemma 
13. 101 below shows that bounded complexity algebraic groups have a bounded 
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complexity center. Thus the complexity of Gk is not bounded as k grows. 
As this argument was purely algebraic, it also shows that any isomorphic 
copy of Gk must also have complexity that is unbounded in k. 

In the characteristic zero case, it turns out that such solvable examples, 
consisting entirely of upper-triangular examples, are essentially the only way 
in which unbounded complexity of an algebraic subgroup can occur; we will 
formalise this observation (which greatly simplifies the proof of Theorem 
[23]) in Section El 

Given two varieties V,W m. G of complexity at most M, the product set 
y • VF is the image of ^ x P^ under the regular product map ■ : G x G —> 
G, and is thus a constructible set of complexity Om(1) by Lemma 13.41 
Furthermore, from Lemma 13.71 we can construct an essential range of the 
product map ■ : V x W ^- G; we call such a range an essential product of V 
and W. 

If y is a subvariety of G, we observe from the invertibility of the conju- 
gation map g i— t- a~^ga in the category of regular maps that V"' := {a~^ga : 
g € V} is also a subvariety of G of the same dimension. From Lemma 13.41 
we see that if V, G have complexity at most M, then V"" has complexity 
Om(1)- In a similar spirit, we have the following. 

Lemma 3.10. If G is an algebraic group of complexity at most M , then for 
each a & G, the conjugacy class 

a^ := {g-'ag : g € G} 

and the centraliser 

Z{a) ■= {g e G : ga = ag} 

are constructible sets of complexity Oa/(1). Similarly, if H is an algebraic 
subgroup of G of complexity at most M , then the normaliser 

N{H) := {<? G G : g-^Hg = H} 

and centraliser 

Z{H) ■={g(£G :gh = hg for all h e H} 

are algebraic subgroups of complexity at most 0^/(1). 

Proof. See Appendix El D 

It is a standard fact that the Zariski closure of a group is an algebraic 
group. The following lemma records a quantitative variant of this fact. It 
has been used in the past in several places in the literature, in particular in 
the work of Eskin-Mozes-Oh [26] on uniform exponential growth for linear 
groups (see also [l3]). It was then put to use in additive combinatorics by 
Helfgott in [33] who called it "escape from subvarieties" . 



Lemma 3.11 (Escape from subvarieties). For every M there exists an inte- 
ger m ^ 1 such that the following statement holds: for every algebraic group 
G of complexity at most M, every subvariety V of G of complexity at most 
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M , and every symmetric subset A of G containing id such that A^ C V, 
we have A C H for some algebraic subgroup H of G contained in V of 
complexity Oa/(1). 

Proof. See Appendix |S1 CH 

The (almost) simplicity assumption in our main theorem, Theorem 12.3^ 
will be used in a key way via the following lemma. 

Lemma 3.12 (Product-conjugation phenomenon for varieties). Let G be 
an algebraic group of complexity at most M for some M ^ 1. Let V,W be 
algebraic varieties in G of complexity at most M such that 

< dim(y),dim(iy) < dim(G). 

Then at least one of the following holds: 

(i) (G is not sufficiently almost simple) G contains a proper normal 

algebraic subgroup H of complexity Ojv/(l) CLnd positive dimension. 

(ii) (Product- conjugation phenomenon) For Om{^)- generic a G G, there 

exists an essential product [V"" ■ W)^^^ of V"" := a~^Va and W of 

dimension strictly greater than dim(P^). 

Proof. See Appendix |Al D 

In practice, Lemma 13.121 only becomes useful when G is almost simple 
(and in particular, when G is connected). 

We use the terminology "product-conjugation phenomenon" here in anal- 
ogy with the "sum-product phenomenon" in fields. The latter phenomenon 
asserts, roughly speaking, that the only way for a finite subset of a field to be 
approximately closed under both addition and multiplication is if it is essen- 
tially a subfield of the original field. Similarly, the product-conjugation phe- 
nomenon refers to the heuristic that the only (algebraic) sets in an algebraic 
group that are approximately closed under both products and conjugation 
are normal algebraic subgroups. This tension between the product opera- 
tion and the conjugation operation is the key to our methods for controlling 
approximate groups, particularly in Lemma 15.31 iii which the behaviour of 
the intersection of an approximate subgroup with a maximal torus is studied 
with respect to conjugation of that torus. 

4. The Larsen-Pink inequality for approximate subgroups 

Suppose that G is a simple algebraic group over some field and that A C G 
is a X-approximate subgroup of G which is "sufficiently dense" in G. The 
Larsen-Pink inequality asserts, roughly speaking, that one has a bound of 
shape \Ar\V\ < ^0(i)|^|dim\//dimG £qj. g^^j bounded complexity subvarieties 
V ^ G. Here is a precise statement of this fact, which is fundamental to our 
work. 
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Theorem 4.1 (Larsen-Pink inequality for symmetric subsets). Let M ^ 1, 
let k be an algebraically closed field, and let G be an algebraic group of over k 
complexity at most M and positive dimension. Let A be a symmetric subset 
of G. Then at least one of the following statements hold: 

(i) (G is not sufficiently almost simple) G contains a proper normal 

algebraic subgroup H of complexity Oj\/(l) and positive dimension; 

(ii) {A is not sufficiently Zariski dense) A is contained in a subvariety 

of G of complexity Om(1) cind dimension strictly less than G; 
(iii) For every subvariety V of G of complexity at most M , one has 

for some C = Om(1)- 

Remarks. As the name suggests, the proof of this inequahty wiU fohow 
the arguments of Larsen and Pink [32] , who treated the case when A was a 
genuine finite subgroup; but it turns out that the arguments extend without 
much difficulty to arbitrary symmetric subsets. The idea that the techniques 
of Larsen and Pink might be useful to us came to us through an analogous 
application of these ideas by Hrushovski |35] (see also [36] ) . It is also worth 
remarking that several special cases of Theorem 14.11 are established in Helf- 
gott's work ^33j (for example in certain cases where ^ is a maximal torus). 
The Larsen-Pink and Helfgott arguments are fundamentally rather similar, 
although this has only become clear rather recently. A very similar inequal- 
ity has also been recently established by Pyber and Szab6[49]. 

Remark. In this section we work with an arbitrary algebraic group G. In 
particular G is not assumed to be linear and may for instance be an abelian 
variety. 

We have stated the theorem for symmetric subsets A in general, but will 
apply it when A is an approximate subgroup (see Corollary 14.31 below). 

Proof of Larsen-Pink. The basic idea of the proof of Theorem 14.11 
is not especially difficult to describqj, but the details require some effort. 
Suppose that V^,V^ is any pair of subvarieties of G with dim(y^) < 
dim(y+). Then we bound l^dnV"! and l^dny+l in terms of [Td'^^^^ny^l and 
I^C(i) Pi y+l^ where the pair V~ , V^ is somehow "easier" to deal with than 
V^ ,V^ . By iterating this replacement algorithm or variants of it, we can 
reduce to the fairly trivial task of bounding lA*^*-^) n V\ when dim(y) = 
or dim(G). 

The idea behind the construction of V~ and V^ is not especially difficult 
to describe either. If G is not sufficiently almost-simple then (i) holds. 
Otherwise, Lemma 13.121 applies and it roughly states that a generic g & G 
has the property that the dimension of the product {V~)^ ■ V~^ is strictly 



Our argument here is inspired by the presentation of the Larsen-Pink inequahty in |35l 
Proposition 5.5], though the notation and language used there is rather different from that 
given here. 
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greater than dim(y"''). If no such g hes in A then A cannot be sufficiently 
Zariski-dense, in which case (ii) holds. If there is some g (z A with this 
property then set V~^ := {V~)^ ■ V^ and take V~ to be a suitable fibre of 
the product map 

{vy X y+ ^ y+. 

Generically we will have dim(y^) < dim(y^) and dim(y^) > dim(y~*"), 
and this qualifies the pair V~ , V~^ as simpler than V~ , V~^ (in that this pair 
of varieties lies closer to a pair of varieties for which we can apply the trivial 
bound). Furthermore it is clear than we can hope to bound \A n V~\ and 
l^ny+l in terms of |Any~| and |^ny+|, as stated. 

In reality the argument is slightly more complicated, this being due to 
our rather cavalier use of the word "generically". Furthermore the actual 
details of the iterative argument are tricker to handle than one might hope. 

We begin by formulating a more precise lemma encapsulating the above 
observations. 

Lemma 4.2 (Inductive step). Suppose that A is a symmetric subset of 
G and that V~,V~^ are subvarieties of G of complexity at most M with 
dimensions d~,d^ satisfying < d~ ^ d~^ < dim(G). Then one of the 
following three alternatives holds: 

(i) {G is not sufficiently almost simple) G contains a proper normal 

algebraic subgroup H of complexity Om(1) CLnd positive dimension; 

(ii) (A is not sufficiently Zariski dense) A is contained in a subvariety 

of G of complexity Om(1) and dimension strictly less than G; 
(iii) There are subvarieties V^ and V+ of complexity Om(1) such that 

\Ar\v-\\Ar\v+\ ^ 4|A^ny-p^ny+|. (4.2) 

and whose dimensions d^,d'^ satisfy either 

(1) d" < d^, (i+ > (i+ and J" + d+ = d" + (i+ or 

(2) d- +d+ <d- + d+. 

We will prove this lemma below, but first we show how it implies Theorem 
14.11 The beef of such a deduction lies in a proper handling of the inequalities 
in (iii). 

Proof of Theorem \4-l\ given Lemma \4.S\ Set D := dim(G). If dim(y) = 
then the result follows from Bezout's theorem (Lemma 13. Sh whilst if 
dim(y) = dim(G) then the result is trivial. We refer to these cases as 
the endpoint bounds. Suppose, then, that < dim(y) < dim(G). We 
shall inductively define pairs {Vj^ ,V^) of varieties of complexities at most 
Mi = Oj,m(1)i « = 0, 1, 2, . . . , initialising so that Vq", V^"^ := V . Suppose we 
are at the ith stage of the iteration: if the algorithm has not already stopped 
then the dimensions d^ = dim(V^ ) will satisfy < d- ^ d^- < dim(G). 
This means that it is valid to apply Lemma 14.21 and we do this with A^ in 
place of A and F~ := F^^, V^ := V^^ . If case (i) or (ii) occurs then, pro- 
vided the algorithm has only run for Oa/,_d(1) steps, we obtain conclusions 
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(i) and (ii) of Theorem 14.11 respectively. In the more interesting case that 
(iii) occurs then we obtain varieties V~ and V~^ such that (j4.2p holds and 
such that the dimensions d~,d~^ satisfy (1) or (2) of Lemma l4.2i We then 
distinguish several cases, as follows. 

Case 0. If J" = and d"*" = dim(G) then stop; 

= y+; 



Case 1. 


ud- 


" = then set V^,^ = 


vr and V^X, 


= y+ 


Case 2. 


If d+ = dim(G) then set V-l^ = 


: y+ and Vr^, 


Case 3. 


IfO 


< J- ^ d+ < dim(G) 


and 










|^4-+ln^+|l/J+ 


^\A^ 


'^' n V- 


-|i/j- 


then set V 


j+i = 


: vr and V+^ = V+; 








Case 4- 


IfO 


<d- i^d+ < dim(G) 


and 










|^4-+ln^+|l/J+ 


^1^^' 


^'nv~ 


ji/d- 

1 5 


then set V 




■Vt and Vr^, = Vr. 









By induction on i one may check the lower bound 

\A^^ n y±| >i,M 1^ n y|^f /dimi/^ (4.3) 

at each step. To see this in cases 1 and 2, we use (j4.2p (for A^ ) and the 
endpoint bounds. To see this in cases 3 and 4, we observe from (|4.3p and 
the inequality d~^ + d~ ^ d^ + d~ that 

maxd^^""-' n y+|V'^"^|^^"^' n yrr^'") 

» maxd^l^'^' n V+\^/'^^, |A^' n V-\^/'^~) (4.4) 

from which the claim ()4.3p follows. 

If the algorithm we have described terminates in time Od^I), then Theo- 
rem ST] now follows. Indeed upon termination we have dim(T^'^) = dim(G), 
and so the theorem is indeed an immediate consequence of ()4.3p . 

It is not immediately clear that the algorithm does terminate, however. 
To analyse this situation, note that by a similar inductive argument we may 
in fact establish the stronger bound 

\A^'' n Vr^\ »,,M \A n y|(l+'=l.)'"'^,*/dimy (4_5) 

for some constant c/j > 0, where m is the number of invocations of alterna- 
tive (2) of Lemma |4.2[ Indeed, when (2) occurs thwn we have d~^ + d'^ < 
d~^ + d~ , which allows us to raise the right-hand side of (j4.4p by 1 + c/j for 
CD small enough, and the claim (j4.5p then follows by repeating the proof of 

(S^D- 

If m > 21ogD/log(l-|-C£)) then ()4.5p already implies Theorem 14. 1 1 simply 

by applying the trivial bound |^'^ n V^ | ^ |A^ | to the left-hand side of 
(j4.5p . Suppose, then, that there are just Od(1) invocations of alternative 
(2) of Lemma 14.21 Observe that the algorithm cannot run for more than D 
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steps using only invocations of (1), since at any such invocation we have the 
inequahty 

dt+i-d^+i>dt-d-. 

It follows that in this case the algorithm does indeed terminate in time 
Od{1), and so the proof of Theorem 14.11 follows by our earlier remarks. D 

Remark. A more careful inspection of the above argument shows that 
the constant cd can be bounded below effectively by ^ D~^. This allows 
one to control the number of iterations here effectively, and this in turn 
means that in the bound (7-|^|^C'2|dimy/dimG -^^ Theorem 14.11 one can take 
the exponent C2 to be an effective quantity of the form exp(0((dim G) '^^)). 
Our arguments do not make the multiplicative factor Ci effective, however, 
due to our reliance on ultrafilters. By laboriously replacing all of those 
arguments by effective algebraic geometry lemmas it ought to be possible to 
furnish an explicit dependence of Ci on dim(G) and the complexity bound 
M, but this would be a considerable amount of work, the bounds would 
likely be bad and we do not at present have any applications for such a 
result. 

It remains, of course, to establish Lemma [4. 2 [ Suppose then that we have 
subvarieties V^jV'^ as in the statement of that lemma and that neither 
(i) nor (ii) of the lemma holds. That is to say, G has no proper normal 
subgroups H of complexity Om(1) and positive dimension, and A is "suffi- 
ciently Zariski-dense" in the sense that it is not contained in any subvariety 
of complexity Ojv/(l) and dimension strictly less than dim(G). We are free 
to choose these unspecified constants Oa/ (1) as we please during the proof. 

By Lemma [3.121 we see that for a Oa/ (l)-generic set of points a € G, 
there exists an essential product (V"_" • V+)'''^'^ of V^ and V+ of dimension 
strictly greater than d'_^_. Since A is sufficiently Zariski-dense, at least one of 
these points lies in A. Henceforth we fix such an a such that the essential 
product W ■.= {V^- y+)'="" of V^ and V+ of dimension > d'_^. 

By definition of essential product, W has complexity Om{^), and there 
exists a subvariety S of V- x V+ of dimension < d^ + d^ and complexity 
Oa/(1) such that 

(v")"w+ G W 

for all {v~,v^) G {V^ x y+)\S', and such that for any w G W, the set 

{{v-,v+) eV- xV+ : (?7~)"t7+ = w} (4.6) 

is contained in a variety of complexity Oa/(1) and dimension at most 

dim(y- X V+) - dim(Ty) < d'_. 

We distinguish two cases: 

Case 1. \{AxA)nS\^l\AnV-\\AnV+\; 
Case 2. \{A x A) n S\ ^ ^\AnV-\\AnV+\. 
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These will correspond to options (1) and (2) of alternative (iii) of Lemma 
14.21 respectively, as we shall now see. 

Suppose first that we are in Case 1. For each w G W, set 

F^ := {v- eAnV : (i;-)"v+ = w}, 

By simple double-counting we have 

-\Anv-\\Anv+\ ^ |(Ax A)n5| ^ ^ \F^\ ^ \F^g\\A'^nw\ 

wew 

where wq £ W maximises the cardinality of |-F«)ol- Applyi'^S Lemma 13.41 to 
(j4.6|) . we see that F^j^ is contained in a subvariety V~ of G with dimension 
< d'_ and complexity Om{^)', it is also clearly contained in A'^. Setting 
Y+ ;= W, we obtain conclusion (1) of Lemma 14.21 (iii) in this case. 

Now suppose alternatively that we are in Case 2. By Lemma 13.81 we can 
find a subvariety V'~ of V~ of complexity Oj\/(l) and dimension < d" such 
that for every v~ G V~\V'~ the fibre F^- := {v^ S V~^ : (t;~,w+) € S} is 
contained in a variety of dimension < d^ and complexity Om(1)- Write 

So:={{v-,v+) eS:v- eV'-}. 

We divide into two further subcases: 

Case 2a. \{A x ^) n S^ol ^ i|(^ x .4) n S\] 

Case 2b. |(^ x yl) n (5 \ So)| ^ ^\{AxA)nS\. 

Recall that both are subcases of Case 2, which means that | (j4 x yl) n S"! ^ 

^\Ar\V-\\Anv+\. 

In Case 2a, simply take V~ := V'^ and V^ := V^ and note that 

|(Ax A)n5ol ^ \Ar\V'~\\Ar\V^\ = \Ar\V-\\Ar\V+\. 

This verifies alternative (2) of Lemma 14.21 (iii) in this case. 

In Case 2b, take V~ := V~~ and take V'^ to be that amongst the fibres 
F^-, v~ G V~\y'~ ^ having largest intersection with A. Then 

|(Axyi)n5K2|(^xA)n(5\S'o)| ^ ^ |F„-nAK |Any~||Any+|, 

confirming alternative (2) of Lemma 14.21 (iii) in this case. 

All eventualities having been covered, the proof of Lemma [4.2l (and hence 
that of Theorem 14. ip is complete. D 

If ^ is a /^-approximate subgroup of a group G, then clearly we have 
\A!^\ ^ i^'"~^|A| for any m G N. As a consequence of Lemma 13.111 and 
Theorem 14.11 (applied to A™" for some large integer ?tio)i we obtain the 
following corollary. 

Corollary 4.3 (Larsen-Pink for approximate subgroups). Let K,mQ,M ^ 
1, let k be an algebraically closed field, and let G be an algebraic group of 
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over k of complexity at most M and positive dim,ension. Let A be a K- 
approxim,ate subgroup of G. Then at least one of the following statements 
hold. 

(i) (G is not sufficiently almost simple) G contains a proper normal 

algebraic subgroup of complexity OM,mo{^) and positive dimension; 
(ii) {A is not sufficiently Zariski dense) A is contained in an algebraic 

subgroup of G of complexity OM,mo(l) o.nd dimension strictly less 

than G; 
(iii) For every 1 ^ m ^ ttiq and every subvariety V of G of complexity 

at most M , one has 

We note the following important consequence of this corollary. 

Lemma 4.4 (Centralisers are rich). Let G be an algebraic group of com- 
plexity at most M , let A be a K -approximate subgroup in G, and let a & A. 
Then at least one of the following statements holds: 

(i) (G is not sufficiently almost simple) G contains a proper normal 

algebraic subgroup H of complexity Om(1) and positive dim,ension; 

(ii) [A is not sufficiently Zariski dense) A is contained in an algebraic 

subgroup of G of complexity 0^/(1) and dimension strictly less than 

G; 

(iii) For every a (z A, one has 

\A^ n Z{a) I ~A/ !^|di"^(^('^))/ di'^(^) , (4.8) 

in the sense that the left and right sides are equal up to multiplica- 
tion by a quantity of the form Om{K^'^'^^'). 

Proof. The upper bound implicit in (j4.8p comes from Corollary 14.31 and 
Lemma [3.10| so we focus on the lower bound. We allow all implied constants 
to depend on M. 

The fibres of the regular map (j) : G ^ G defined by (j){g) := g~^ag are 
all cosets of Z{a), and thus have dimension dim(Z(a)). As a consequence, 
(/)(G) = a has dimension dim(G) — dim(Z(a)). By Lemma [3. 4( this variety 
has complexity 0(1). Applying Corollary 14. 3( we thus have either (i) or (ii), 
or else 

\(I){A)\ < _?^0(l)|^jl-dim(Z(a))/dim(G)_ 

By the pigeonhole principle, there thus exists b G a^ such that 
\{g£A: g-^ag = 6}| > K-0(i)|A[dim(z(a))/dim(G)_ 

But if g, h lie in the above set, then hg^^ lies in A^ n Z{a). This gives the 
desired lower bound for (14.811. D 



Remarks. We remark on the connection between our proof of the Larsen- 
Pink inequalities and Helfgott's work in [SSj. If A is an approximate sub- 
group of G = SL„(Fp), Helfgott obtains a bound of the form |^ n T| ^ 
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/^0(i)|^|i/(n+i)^ where T is any maximal torus in G; this is a special case of 

the upper bound in (j4.8p . He does this by examining products T"i 7"«n+i 

of conjugates of T, showing that for tuples (ai, . . . , a„+i) in a Zariski-dense 
set the fibres of the map 

(il, . . . , tn+l) l-> t"^ . . . t„!|_Y 

have size 0„(1). In a sense, this argument is precisely the same as our 
argument above in this special case, except that the necessary algebraic 
geometry is done in a completely explicit fashion (which has the advantage 
that effective bounds are produced). It is very helpful in the arguments in 
[33j that dim T exactly divides dim G, but this is probably not essential. We 
remark that Helfgott also obtains lower bounds for the intersection of certain 
maximal tori with A, but not for all tori containing a regular semisimple 
element of A. 

5. The case of almost simple algebraic groups 

We turn now to the main business of the paper and, in particular, the 
proof of Theorem 12. 3i We begin by reviewing some standard terminology 
concerning almost simple algebraic groups. In this section we deal with 
linear algebraic groups only. 

Let k be an arbitrary field and k an algebraic closure of k. By definition, 
an absolutely almost simple k-algebraic group G is a non-abelian connected 
linear algebraic group defined over k with no non-trivial proper normal con- 
nected algebraic subgroup defined over k; in particular, G has no proper 
normal algebraic subgroups of positive dimension. When k is algebraically 
closed we simply talk about almost simple algebraic groups. We denote by 
G = G{k) its group of /c-points. 

From now on, unless otherwise stated, we will assume that k is alge- 
braically closed because our main result, i.e. Theorem 15.51 is formulated 
in this setting. We will consider non-algebraically closed fields only when 
applying Theorem 15.51 to the finite field case in order to prove Theorem 12.31 
from the Introduction. 

When k is algebraically closed (as we now assume), almost simple alge- 
braic groups have a well-known structure and are parametrised by a pair 
(g. A), where 3 is a finite dimensional simple complex Lie algebra (given in 
terms of its defining root system A) and where A is a certain free abelian 
group to be chosen among a finite collection of such. We refer the reader 
to Humphreys' book [34| for a pleasant and thorough introduction to this 
material. 

Finite dimensional simple complex Lie algebras are parametrised by (re- 
duced, irreducible) root systems. According to the Cartan-Killing classifica- 
tion, these root systems fall into 4 infinite families {An)n^i, {Bn)n^2-, iCn)n^3 
and {Dn)n^4 or are one of the 5 exceptional ones Eq, Ej, Eg, F4 and 
G2- The infinite families correspond to the so-called classical Lie algebras 

stn+i,S02n+i,Sp2n and SO^n- 
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To every (abstract, reduced, irreducible) root system A are attached two 
free abelian groups of the same common finite rank, the lattice of roots A,- 
and the lattice of weights A^ which contains it. Their rank rk is called the 
rank of the root system, or of q. It can be seen that the group A^j/A^ has 
cardinality bounded by rk+1. 

To every pair (g. A) one may associate in a unique fashion an almost 
simple algebraic group G defined over k associated to (fl, A). Its center can 
be identified with IIom(A/A.r, k*), hence is of size at most rk(G) + l (equality 
being realized for instance when G = SL„ and k = C). When A = A^., G 
is said to be adjoint, while when A = A^, G is said to be simply connected. 
Note that adjoint groups G{k) are center-free, and hence abstractly simple. 
For the explicit construction of G from (fl. A), see j34j . 

It follows in particular from this construction that dimG = dimg, which 
in turn implies that there are only finitely many isomorphism classes of 
almost simple algebraic groups over k whose dimension is given, because 
the same holds for complex Lie algebras as a result of the Cartan-Killing 
classification. 

Furthermore, the complex Lie algebra q admits a free abelian subgroup 
of full rank, denoted by qz, such that qz is stable under braket, and Qz'i^ k 
coincides with the Lie algebra Qk of G{k) defined as the Zariski tangent 
space at the identity of G(A;). The group G(A;) acts naturally on g^ via the 
adjoint representation. 

Example. If g is a complex simple Lie algebra of type An, then q is 
isomorphic to s[„,+i(C), the Lie algebra of traceless complex square matrices 
of size n+1. Let f) be the subalgebra of diagonal matrices. The root system A 
of consists of "'2 roots of g defined as the linear forms (A,— Aj)j^j, where 
Aj G f)* is the i-th diagonal coefficient. The lattice A,, is the subgroup of f)* 
generated by the (Aj — Xj)i^j, while the lattice of weights A^ is the subgroup 
of f)* generated by the (Aj)j's. Here A^/A^ ~ Z/(n + 1)Z and rk = n. When 
A = A^, then the group associated to (g. A) and an algebraically closed field 
k is exactly the classical group SL„+i(A;) of determinant 1 matrices over the 
field k. When A = A^ the corresponding group is PGL„+i(A;) of projective 
linear transformations of fc"""*"^. 

A torus S" in G is a connected diagonalisable algebraic subgroup of G. 
By diagonalisable, we mean that under some faithful algebraic embedding 
of G in GL„ (recall that every linear algebraic group admits a faithful al- 
gebraic finite dimensional representation), S* is a subgroup of the diagonal 
matrices for some choice of basis. This notion is well-defined (see the books 
of Humphreys [34] or Borel [4]). A maximal torus is a torus of maximal 
dimension in G. It can be shown that they are all conjugate in G{k). 

Given a maximal torus T of G, the group of characters X{T) := Hom(T, k*) 
can be identified with the lattice subgroup A. Roots q € A thus give rise to 
characters. 

The centraliser of T coincides with T, while T has finite index in its 
normaliser N(T). The quotient group N(T)/T is the so-called Weyl group 
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of G. It permutes the roots of G and can be identified with the abstract 
Weyl group of the root system A. 

The subgroups Tq = kera are the so-called maximal singular tori; they 
are subtori of T. An element g in G{k) is said to be semisimple if it is con- 
tained in some torus. A semisimple element of T is called regular semisimple 
if it is not contained in any of the T^, a G A. Regular semisimple elements 
are precisely those elements g of G(A;) such that the multiplicity of the eigen- 
value 1 in the matrix representation Ad{g) on Qk is minimal. This is clearly a 
Zariski-open condition. The centraliser Z{a) of a regular semisimple element 
a has minimal possible dimension, namely the rank rk G. Its connected com- 
ponenljj of the identity is the unique maximal torus T = Z{a)o containing 
a. In particular Z{a) ^ A^(r) and \Z{a)/Z_{a)o\ ^ |A^(r)/r|. 

If k is not algebraically closed, then G{k) is of the type described above, 
but there are in general several isomorphism classes of A;-algebraic groups 
which are nonetheless isomorphic over k. These are called the A;-forms of 
G. When k is finite, they have been entirely classified by Steinberg [55] and 

Tits [miEi]. 

For the above and more background on almost simple algebraic groups 
and Chevalley groups, we refer the reader to the books by Humphreys [34] 
and Borel [3] and to Steinberg's lecture notes |54] . We record some of the 
above facts in the form of a lemma. 

Lemma 5.1. Let M ^ 1, and let G be an almost simple^ algebraic group 
over an algebraically closed field k of complexity bounded by M . 

(i) Regular semisimple elements of G = G{k) form a Zariski open 

subset of G whose complement is of complexity at most Om{1). 
(ii) Each regular semisimple element a is contained in precisely one 
maximal torus, which is the connected component of the centraliser 
Z{a) of a. 
(iii) All maximal tori are conjugate, and their common dimension is the 

rank rk(G). We have rk(G) < dim(G). 
(iv) Given a maximal torus T, the Weyl group N(T)/T has cardinality 
Oa/(1). 

Proof. Most of these results are standard, except perhaps for the complexity 
bound in (i). This follows from the fact that the adjoint representation 
of G{k) on 0fc has complexity Om(1) and the condition that Ad{g) has 
the maximum number of distinct eigenvalues is clearly given by the non- 
vanishing of polynomials of degree Odim(g)(l) iii the entries of Ad{g). D 

Example. If G = SL„, then the semisimple elements are the diagonalisable 
matrices in G{h), and the regular semisimple elements are those matrices 



We thank J.-P. Serre for pointing out to us that Z{a) may not be connected if G is not 
simply connected. 

Recall that we require almost simple groups to be non-abelian, thus excluding the de- 
generate one-dimensional case. 
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whose eigenvalues are distinct. All maximal tori are conjugate to the group 
of diagonal matrices in SL„, and the Weyl group is isomorphic to the permu- 
tation group Sn, which has order cardinality n\. We recommend to readers 
who are unfamiliar with the general theory of algebraic groups that they use 
this model case G = SL„ as a running example. 

Remark. The above classification of almost simple algebraic groups over 
an algebraically closed field shows that up to isomorphism there are only 
finitely many such groups in each given dimension. In particular there al- 
ways exists a model of G whose complexity is bounded in terms of dim(G) 
only. The same applies to semisimple algebraic groups (see [M])) they are 
isomorphic to a direct product of simple algebraic groups modulo a finite 
central kernel (whose size is bounded in terms of dim(G) only). In character- 
istic zero, the situation is even better: there are only finitely many conjugacy 
classes of almost simple algebraic subgroups of GL^^, and their complexity 
is thus Od(l) (this follows easily from complete reducibility and the classi- 
fication of irreducible linear representations of simple algebraic groups via 
their highest weight). However things seem more delicate in positive char- 
acteristic. 

With these preliminaries out of the way, we now come to a key definition. 

Definition 5.2 (Involved torus). If A is a sym,m,etric subset ofG{k), andT 
is a m^axim^al torus of G, we say that T is involved with A if A'^CiT contains 
a regular semisimple element. We let S^ = 3^ {A) denote the collection of 
all involved tori. 

The following lemma is fundamental to our work. It was inspired by 
the proofs of the sum-product phenomenon by Bourgain, Glibichuk, and 
Konyagin |9j and its interpretation given by Helfgott in [33] in terms of 
group actions. It has also been discovered independently by Pyber and 
Szabo [50]. We thank Harald Helfgott for pointing out a simplification to 
the proof of this lemma which, in the form below, is closer to the argument 
of Pyber and Szabo than it is to our original one. Since neither argument 
is particularly long, we give the slightly simpler version here, referring the 
reader to our announcement |15j for the original. 

Lemma 5.3 (Crucial lemma). Let M, K ^ 1, and let G be an almost simple 
algebraic group with complexity at most M over an algebraically closed field 
k. Let A be a K -approximate subgroup of G. Then at least one of the 
following statements holds. 

(i) {A is not sufficiently Zariski dense) A is contained in an algebraic 
subgroup of G of complexity Om(1) oind dimension strictly less than 
G. 

(ii) {A is small) \A\ <m iT'^^^^). 

(iii) The set ^ of involved tori T has the cardinality bounds 

, dim(T) -I dim(r) 

_^-OM(l)[^|i dhn{G) ^j^^ 1^1 ^^^ /^OM(l)[^|i dim{G) (5.I) 
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and is invariant under conjugation by A (and hence by the group 
(A) generated by A) . In other words, if T is involved, then so is 
a^^Ta for any a G A. 

Proof. We allow all implied constants to depend on M. 

Let us first show the cardinality bounds (|5.ip . By Lemma 15.11 and Corol- 
lary H^Sl either (i) holds, or the number of elements of A^ which are not reg- 
ular semisimple is 0(ir'^^ |A| dim{G) ^ (note that the first option in Corol- 
lary |3]3] cannot occur because G is almost simple.) Thus, either (i) or (ii) 
holds, or the number of regular semisimple elements of A^ is comparable 
to l^^l = i^'^^^^l^l. By Lemma |5. 11 these regular semisimple elements can 
be partitioned into involved tori, and by Lemma 14. 4^ either (i) holds, or 
each involved torus T contains a number of elements of A^ comparable to 
^0(i)|^|dim(T)/dim(G)^ gy Lemma O and Corollary [331 either (i) or (ii) 
holds, or the number of elements in ^^ n T that are not regular semisimple 
is at most /i:0(i)|A|(<i™(^)-i)/'ii"^(<^). Thus, either (i) or (ii) holds, or every 
involved torus T contains a number of regular semisimple elements in A^ 

I dim(T) 

comparable to |^| dim(G) _ xhe claim (j5.ip follows. 

Now, let T be an involved torus, let a & A, and let T := a~^Ta be the 
conjugate of T by a. Being involved, T is the connected component of the 
centraliser Z{b) of some regular semisimple b £ A^. Applying Lemma |4.4| 
we either have (i), or 

\A^ n Z{b)\ > K-0(l)|^|dim(T)/dim(G)^ 

Suppose we are in the latter case. Since \Z{b)/T\ ^ \N{T)/T\ = 0(1) one 
coset of T in Z{b) must have a large intersection with A^, in particular 

I A"^ n r| » i^-O(l) |^|dim(T)/ dim(G) _ 

Conjugating by o, we conclude 

I A^ n f I » i^-'^(l) \AfMT)/ dim(G) _ 

On the other hand, we have 

\A-{A^nf)\ ^ |A^| <.K^^^^\A\. 

Thus, by the pigeonhole principle, we can find an element g oi A ■ (A^ n T) 
which has » i^-C>(i)|^|'ii™(^)/dim(G) representations of the form g = at, 

where a £ A and t € A^ H T. But if at = a't' for some a, a' G A and 
t, t' eA^n T, then {a'y^a = t't'^ £ A^ nf. Thus we have 

I A^ n T| » i^-<^(l) \AfMT)/ dim(G) _ 

On the other hand, by Lemma FS.ll and Corollary l4.31 the number of elements 
of ^^ n T which are not regular semisimple is at most 

0{K'^^^^ |^|(dim(T)-l)/ dim(G)^_ 

Thus we are either in case (ii) or case (iii), and the claim follows. D 
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To proceed further, we need to divide into two cases depending on whether 
the group {A) generated by A is finite or infinite. In the infinite case, we 
use the following^ simple lemma. 

Lemma 5.4. Let M ^ 1, and let G be an almost simple algebraic group 
of complexity at most M over an algebraically closed field k. Let H be an 
infinite subgroup of G{k), which can be covered by finitely many left cosets 
aT of a torus T of complexity at most M inside G. Then there exists a 
torus S {not necessarily maximal) of complexity Om{^) such that H D S is 
infinite, and H is contained in the normaliser N{S) of S. 

Proof. It is clear from the assumption that H CiT is clearly infinite, so if 
H is contained in N{T) then we are done. Suppose, then, that there exists 
h (z H such that T := h~^Th ^ T. Conjugating H by h, we see that H can 
be covered by finitely many left cosets of T. The intersection of a left-coset 
of T and a left-coset of T is either empty or a left-coset of the subgroup 
TnT, which has complexity Om(1) and has strictly smaller dimension. On 
the other hand. Lemma lA.41 guarantees that the connected component S of 
TnT has complexity Oa/ (!)• We thus see that H can be covered by finitely 
many left cosets of the torus S. As we cannot have an infinite descent of 
tori of decreasing dimension, the claim follows by iterating this argument. 
D 

We may now state and prove a result that is, essentially, a precise form of 
our main theorem. Sometimes (for example for applications to expanders) 
this precise form is more helpful than Theorem 12.31 itself. We give the 
deduction of Theorem 12.31 afterwards. 



Theorem 5.5. Let M,K ^ 1, and let G be an almost simple algebraic 
group of complexity at most M over an algebraically closed field k, and let 
A be a K -approximate subgroup ofG{k). Then at least one of the following 
statements hold. 

(i) [A is not sufficiently Zariski dense) A is contained in an algebraic 
subgroup of G of complexity Oa//(1) and dimension strictly less than 
G. 
(ii) {A is small) \A\ <a/ K'^mW ^ 

(iii) {A controlled by (A)) The group (A) generated by A is finite, and 
has cardinality \{A)\ <a/ if<^A/(i)|^j_ 

Proof. We allow all constants to depend on M. From Lemma [5.1l (i) we see 
in particular that G has complexity Oa/? (1). There are two cases, depending 
on whether {A) is finite or not. 

First suppose that (A) is finite. From Lemma 15.3^ either (i) or (ii) holds, 
or the set ^ of involved tori is invariant under conjugation by (^4). On the 
other hand, {A) is certainly a 1-approximate subgroup of G = G(A:), so by 



7 

'In the case k = C, one could also use Jordan's lemma here. 
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Corollary 14.31 either (i) holds, or the intersection of {A) with any maximal 
torus has cardinality < |(^)|dim(r)/dim(G)_ g-^^^g |iV(r)/r| < 1 by Lemma 
15.11 we conclude that the stabiliser of the action of (A) on any torus in ^ 
has cardinality <^ |(J^^|dim(-r)/dim(G)_ gy ^j^g orbit-stabiliser theorem, this 
implies that 

1 dim(T) 
1^1 » |(yl)| dim(G). 

Comparing this with ()5.ip we obtain (iii). 

Now suppose instead that (A) is infinite. The orbit-stabiliser theorem 
and Lemma 15.31 then implies that either (i) or (ii) holds, or a finite index 
subgroup of {A) lies in N{T) for some T £ JT. In the latter case, since 
N{T)/T is finite, this implies that {A) is covered by a finite number of 
cosets of a torus T. By Lemma 15. 4^ we thus see that {A) is contained in 
the normaliser N{S) of a torus S of dimension at least one and complexity 
Om(1)- But such normalisers have complexity Om(1) by Lemma [3. 10^ and 
dimension strictly less than dim(G) by Lemma |5.H so we obtain (i) in this 
case. n 

We can now prove Theorem 12.31 whose statement we first recall. 

Theorem 12.31 (Main theorem). Let k he a finite field and let G he an 
ahsolutely almost simple algebraic group defined over k, and suppose that 
A C G{k) is a K -approximate group that generates G{k). Then A is 
K '^'^'^'^^'i -controlled by either {id} or by G{k) itself. 

Proof. Let k be the algebraic closure of k. The algebraic group G is a 
A;-form of the almost simple algebraic group G{k). Such /c-forms over finite 
fields have been entirely classified by Steinberg [55] and Tits [60l |61] . In 
particular, for every type of Dynkin diagram there is a bounded number 
(independently of A;) of such forms. As the statement of the theorem is 
invariant with respect to group isomorphism, we may thus assume that the 
complexity of G is bounded in terms of dim(G) only. 

Applying Theorem 15.51 it suffices to establish that option (i) of that the- 
orem cannot hold for \k\ ^ C'dim(G)- In other words, we must establish 
that G(A:) is "sufficiently-Zariski-dense" in G(A;) in the sense that the low- 
est complexity M^ of any proper subvariety of G{k) containing G(A;) tends 
to infinity with |A;|. Suppose, then, that V C G{k) is a proper subvariety 
of complexity M (not necessarily defined over k). The required statement 
then follows by combining the following two facts: 

• V{k) = Vn G{k) has cardinality at most OM(|fc|'^™^); 

• (Lang-Weil, i41j) The cardinality of G{k) is >m |A;|'i'™^. 

In connection with the first fact, a well-known variant of the Schwarz-Zippel 
lemma (see [25\ Lemma A. 3] for a simple proof) asserts that an irreducible 
affine variety V defined by m equations of degrees ^ d and coefficients in k 
has at most d'"(|A;|-|-l)'^'™^ points over k. In connection with the second, we 
note that in all cases (both split or non-split, see e.g. [H]) an exact formula 
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for |G(Fg)| in terms of q is known, and so the somewhat deep Lang- Weil 
estimate is really overkill here. For example, it is extremely well-known and 
easy to see that 

I sL^(jp )| ^ (g" -l)(g"-g)(g" -?')••• (g"-g"-^) ^ 
This concludes the proof of Theorem 12.31 D 

6. Approximate subgroups of GL„(C) 

The objective of this section is to establish Theorem 12.51 characterising 
approximate subgroups of linear groups in characteristic zero. As remarked 
after the statement of that theorem, we may assume without loss of general- 
ity that the ambient field is the complex numbers, and so we may rephrase 
the statement there with the following one. 

Theorem 12. 5L Let K,n ^ 1, and suppose that A C GL,„(C) is a K- 
approximate subgroup. Then A is 0{K"^^') -controlled by B, a CK "- 
approximate subgroup o/GL„(C) that generates a nilpotent group of step at 
most n — 1. 

There is an appetising strategy for establishing this result, and to outline 
it we recall some basic definitions and facts from algebraic group theory. In 
particular let us recall that if G is an algebraic group over some algebraically 
closed field k then the radical Rad(G) is the maximal normal connected 
solvable algebraic subgroup of G. An algebraic group with trivial radical is 
said to be semisimple; the quotient G/Rad(G) has this property. 

Proposition 6.1 (Semisimple algebraic groups over C). Suppose that G is 
a connected semisimple linear algebraic group over C. Then G is an almost 
direct product of almost simple algebraic groups Gi, . . . ,Gk, each arising 
from one of the irreducible root systems mentioned earlier. More precisely, 
there is a map vr : Gi x • • • x Gk — >• G with \ ker 7r| = Odim(G)(l)- Note also 
that (as a consequence of the classification of irreducible root systems) each 
Gi may be realised as a linear algebraic group of complexity Odim{G)(l)- 

Proof. See [4, Proposition 14.10]. Note that this proposition in fact es- 
tablishes some additional facts, for instance one can take the Gi to be the 
minimal nontrivial (smooth) connected normal (closed) /c-subgroups of G 
that pairwise commute, and one can also take the kernel ker vr to be central. 
We will not need this additional structure here. D 

As a consequence of this classification and of Theorem 15.51 it is not hard 
to prove the following result. 

Proposition 6.2. Suppose that G is a connected semisimple linear algebraic 
group over C and that A (1 G is a K -approximate group generating a Zariski- 
dense subgroup ofG. Then \A\ = 0(i^<^dim(G)(i)). 
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Proof. Decompose G as an almost direct product 7r{Gi x ••• x Gfc) of al- 
most simple linear algebraic groups over C as in Proposition 16. 11 the kernel 
of vr having size t = On(l)- An easy exercise confirms that the pullback 
B := Tr~^{A) is a (i^t)-approximate subgroup of Gi x • • • x Gm which gener- 
ates a Zariski-dense subgroup. Each of the projections TTi{B) to the factors 
Gi is then a (Kt)-approximate subgroup of Gi generating a Zariski-dense 
subgroup of Gi. The complexities of the groups Gi are bounded uniformly 
by Odim(G)(l)i &iid so we may apply Theorem 15.51 Clearly neither option (i) 
nor option (iii) of that theorem can occur, so we are forced to conclude that 
(ii) holds, or in other words that |7rj(i?)| = 0(K*^dim(G)(i)), This immediately 
implies the stated bound on 1^41 . D 

One is now tempted to argue as follows. Suppose that A is an approximate 
subgroup of GL„(C). Let G be the Zariski closure of {A), and consider the 
image of A under the projection to G/Rad(G). By the above remarks this 
is small and so a large piece of A is contained in a coset of the solvable group 
Rad(G). It is then a relatively easy matter to apply the main result of [H] to 
show that this piece in turn is controlled by a nilpotent approximate group. 

The only problem with this argument is that Proposition 16. II applies only 
to connected semisimple groups. Although a linear algebraic group has 
only finite many connected components, there is no absolute bound on their 
number in terms of the underlying dimension n. It is nonetheless possible 
to make this sketch of the argument work, but to do so we were forced to 
introduce a notion of "bounded complexity Zariski-closure" so as to con- 
trol the number of connected components of G/Rad(G). The details were 
somewhat complicated. However it turns out that by resting a little more 
heavily on known facts from the theory of linear algebraic groups we can 
use a somewhat different argument and thereby sidestep this issue. Critical 
in this regard is the following result, which makes up for the fact that we 
cannot bound the number of connected components of a virtually solvable 
group, by bounding instead the number of solvable components. 

Lemma 6.3 (Number of solvable components). Any virtually solvable sub- 
group of GLn{C) {that is to say, any subgroup o/GL„(C) with a solvable 
subgroup of bounded index) contains a normal solvable subgroup of index 

On{l). 

Proof. In fact, any such subgroup contains a subgroup of index 0„(1) which 
can be conjugated inside the group Upp„(C) of upper-triangular matrices. 
This lemma is "well-known" and follows from results of Malcev [63], §3.6] 
and Platonov [63', §10.11]. We offer a self-contained sketch in Appendix [Bl 
D 

We will also require the following bound on the outer automorphism group 
of a semisimple group. 
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Lemma 6.4 (Bound on Out{H)). Suppose that H is a connected semisimple 
complex algebraic group. Then the group Out{H) := Aut{H) / Inn(H) has 
size OdimH(l)- 

Proof. See 01 IV.14.9]. D 

We turn now to the details of the proof of Theorem 12.51 Henceforth, 
ah impHed constants to depend on n. Let T := (A) be the subgroup of 
GL„(C) generated by A, and let G be the Zariski closure of T. Let Go be 
the connected component of the identity of G, thus Gq is a connected normal 
subgroup of G of finite (but potentially large) index. Let H := Gq/ Rad(Go) 
be the quotient of Gq by its radical; H is then a connected semisimple 
algebraic group whose structure is as described in Proposition 16.11 We 
distinguish two cases, depending on whether H is trivial or not. 

Case 1: H is trivial. This implies that F is virtually solvable. Li this case, 
we can apply Lemma 16.31 and infer that T has a normal solvable (indeed 
upper triangular) subgroup Tq of index 0(1). By the pigeonhole principle 
some coset of Fq contains at least |^|/C points of A, where G = 0(1), and 
thus 1^^ n FqI ^ ^|A|. It follows from basic multiplicative combinatorics 
[56] , or more specifically [HI Proposition 2.1 (iv)], that B := {A^ n Fq)^ is 
a /iT'^'^^-approximate group that i('*^(^)-controls A"^ H Fq. Furthermore, A 
is itself m-controlled by A^ n Fq, where m = 0(1) is the number of cosets 
of Fq containing an element of A. It follows that i? is a solvable K'^^^'- 
approximate subgroup of GL„(C) which i^^^^-controls A. Applying the 
main result of [14J, Theorem 12.51 follows immediately. 

Case 2: H is non-trivial. Write R := Rad(Go) for brevity. Now i? is a 
characteristic subgroup of Gq, since the image of R under any automorphism 
of Gq is also a connected normal algebraic solvable subgroup of Gq. Noting 
that G normalises Gq, it follows that G normalises i?, and thus G also 
acts by conjugation on the quotient group H = Gq/R. This conjugation 
action induces a homomorphism vr : G — )• Aut(-ff). Note that ker(7r)i?/i? = 
Zqiji{H) is finite, since its intersection with the finite index subgroup Gq 
of G lies in the center Z{H) of H, which is finite as H is semisimple. In 
particular, kervr is virtually solvable. By essentially the same argument we 
used in Case 1, it suffices to show that some coset of ker tt contains at least 
X~'^(^)|^| points of A. To achieve this, it is obviously sufficient to show 
that \'k{A)\ ^K^W, 

To do this, observe that 7r(G) certainly contains the inner automorphisms 
Inn(if) = 7r(Go). Recall also Lemma 16. 4^ which asserts that the index 
[Aut(i7) : lnn{H)] is at most 0(1). Setting Fq = F n 7r-^(Inn(iJ)), we 
conclude that Fq is a subgroup of index d = 0(1) in F such that 7r(Fo) is 
Zariski dense in lnn{H). Now lnn{H) is a (connected) semisimple algebraic 
group, and by Lemma IC. II the set B := ^2d-i c^Yq generates Fq. Moreover, 
by the same multiplicative combinatorics results mentioned above, B^ is an 
0(iir^^^ )-approximate subgroup which 0(i^'^^ )-controls A, and therefore 
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tt{B^) is a Zariski-dense 0(-ftr'' '^^)-approxiniate subgroup oiInn{H) which 
0{K'-'^^>)-contToh tt{A). It foUows immediately from Proposition 16.21 that 
\tt{B'^)\ = 0(i^<^(i)), and hence that \7r{A)\ = 0{K'^^^^) as required. D 

We turn now to the deduction of Corollary 12. 6| whose statement we recall 
below. Recall that a finitely generated group G is said to have polynomial 
growth (with exponent d) if it is generated by a symmetric set S which 
satisfies some bound of the form 

|5n ^ Csr'^ (6.1) 

for all r € N, where Cs is a constant which may depend on S". It is an easy 
exercise to show that this a group property, that is to say it does not depend 
on the choice of generators. Recall also that G is virtually nilpotent if it has 
a nilpotent subgroup of finite index. 

Corollary 12.61 Suppose that k is a field of characteristic zero and that 
G ^ GL„(A;) is a {finitely -generated) subgroup of G with polynomial growth. 
Then G is virtually nilpotent. 

Proof. Let S be a symmetric generating set containing id. There are clearly 
arbitrarily large r for which 

i5^n ^ s'^i^n, (6-2) 

since if not the polynomial growth hypothesis would be violated. Call these 
values good, and suppose in what follows that r is good. By [56« Corollary 
3.10] the set A := S^'" is a X-approximate group for some K = O(l)'^. By 
Theorem 12.51 there is some nilpotent group H ^ GL„(fc) and a coset Hx 
such that \A n Hx\ ^ c„^d|^|, where Cn^ > depends only on n and d. We 
therefore have 

\S^'- nH\ = \A^ nH\^ Cn,d\A\ ^ Cn,d\S'\. (6.3) 

Replacing H by the subgroup generated hy A^ n H (if necessary) we may 
assume without loss of generality that H ^ G. Assume that [G : H] = oo. 
Then, since S generates G, it is easy to see that S'^ meets at least m different 
right cosets of H, for every integer m ^ 1. It follows from this observation 
and dnSD that 

Choosing m > 8'^/cn^d ^-nd some good value of r with r > m, we obtain a 
contradiction to (j6.2p . Thus we were wrong to assume that [G : H] = oo, 
and this concludes the proof. D 

The above argument does not give effective bounds on [G : H] (in terms 
of n, Cs and d) on account of the ineffectivity in Theorem 12.51 To conclude 
this section we offer now a very brief sketch of how an explicit bound could 
be obtained without effectivising our main theorem. First, note that an 
immediate consequence of Corollary 12.61 and Lemma 16.31 is that any group 
G ^ GL„(/c) with polynomial growth has a solvable subgroup H with index 
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[G : H\ = On(l); where this 0„(1) can be taken to be some exphcit function 
of the form exp(n ) by working through the proof of Lemma FG-Sl in Appendix 
IBI By Lemma [C. 11 H is finitely-generated and has polynomial growth (with 
effectively computable constant and growth exponent). Finally, one may run 
the argument used to prove Corollarv 12.61 again, but now with H in place of 
G, and with the main result of [H] (which is completely effective) in place 
of Theorem 12.51 We leave the details to the interested reader. 

Finally let us remark that an inspection of the argument used to prove 
Corollary 12.61 shows that we do not need the full strength of the polynomial 
growth hypothesis (j6.ip : indeed we only need this hypothesis for a single 
(sufficiently large) value of r. This type of strengthened version of Gromov's 
theorem first appeared in |57j in the solvable case and [53] for arbitrary 
finitely generated groups. For non-virtually solvable linear groups it is also 
a immediate consequence of the uniform Tits alternative proved in [12] . 

7. A CONJECTURE OF BABAI AND SeRESS 

Some applications of Helfgott's product theorem regarding the diameter 
of the Cayley graphs of SL2(Fp), or equivalently the speed of generation of 
SL2(Fp), are mentioned in Helfgott's original paper [32]. Not surprisingly, 
our result provides similar corollaries. For example, we get a special case of 
a conjecture of Babai and Seress ([Sj Conjecture 1.7]). 

Theorem 7.1 (Diameter of G{k)). For every integer d G N there is a 
constant C = Cd > such that the following holds. If G is an absolutely 
almost simple algebraic group with dim(G) ^ d defined over a finite field k 
and if S is a set of generators for G := G{k), then every element of G is a 
word of length at most Clog |G| in the elements of S. 

Remark. In this section, when we refer to a word of length at most L in 



£i^e2 ^£l 



«2 ■ ■ ■ «L ■ 



elements xi, . . . , x„, we refer to some expression of the form x^^ 
where each £j lies in the set {—1,0, 1}. 

Proof. Let 5 := (i/10dim(G). Using Corollary 12.41 we examine the powers 
5^5^52^.... Unless 

|53"|^|G|i-^ (7.1) 

that result implies that we have the growth bound 

where e = e^ > 0. It follows that if (j7.ip does not hold then 

|S3"|^2(i+^)", 

and so 

AT — 3" <^ Cdhg^'" \G\. 

We have shown that words of length A^ ^ C^log "* |G| in the generating set 
S cover aseti?(=S'^)ofsize|G|^~. Forsuch large subsets of G , arguments 
of Gowers [29] may be used to complete the proof. Gowers proved that if 
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H is a. group in which the smallest dimension of a non-trivial representation 
(over C) is dmin then A^ = H whenever |^| > \G\/d]l^^. See also [21 Ej. To 
use Gowers' result, we note that by a theorem of Landazuri and Seitz [JO] 
the smallest dimension of a non-trival projective complex representation is 

at least Cr-ji:/'!'*'™*'^' ^ Cr\H\^^^ for every Chevalley groupB H = G(A:)+ of 
rank r ^ d over k. It is well known that G{k) contains the Chevalley group 
G(A:)+ as a subgroup of index Or(l) (see e.g. [IS] 3.6(v)). We can thus apply 
the result to G{k)^ instead since \B'^ n G{k)~^\ ^^ |G|"^~ and conclude by 
Lemma [UH that S^-^^'^B^ = G. D 

Babai and Seress actually conjectured that the constant C^ and the im- 
plied multiplicative constant in the above theorem can be taken to be in- 
dependent of the dimension, and that this bound should also hold for the 
alternating groups. Furthermore it is likely that the above theorem is true 
with the smallest possible exponent Cd, that is C^ = 1. See |llj for soine 
evidence supporting this belief, which is related to the so-called Lubotzky- 
Weiss independence problem for expanders [l3]. Below we show that one 
can take C^ = 1 for a random, choice of generating set S. 

Theorem 7.2 (Logarithmic diameter for random generators). For every 
integer d E N there is a constant Cd > such that the following holds. If 
G = G{k) is the group of k-points of an absolutely almost simple algebraic 
group G of dimension at most d over a finite field k, and if two elements 
a, b are chosen at random from G, then with probability 1 — o\q\^^{\) every 
element g G G is a word of length at most Cd log |G| in a, b, a~^ and b~^ . 

We remark that the special case G = SL2(Fp), p prime, of this theorem 
was established by Helfgott |32j . 

To prove Theorem 17.21 we will need the following result from |27j . 

Theorem 7.3. |27) For every integer d G N there is a constant Cd > such 
that the following holds. If G is an absolutely almost simple algebraic group 
of dimension at most d over a finite field k, and if two elements a, b are 
chosen at random from G := G{k), then with probability 1 — o\q\_^^{\), no 
nontrivial word w of length ^ Qlog |G| in the free group F2 evaluates to the 
identity on a, b. 

Remarks. Equivalently, the corresponding Cayley graph of G on generat- 
ing set {a,b,a^^ ,b^^} has girth at least C(ilog|G|. The constant Cd can be 
taken to be any number smaller than (dim(G) log 3)^^. 



Proof of Theorem \ 7. S\ Set A'^i := Qlog |G|. From Theorem 17.31 it follows 



that if o, 6 € G are selected at random then, with probability 1 — o\q\^^{1), 
we have \S'^^\ ^ 3^''^°s|G| =; |G'|'?^ gay. Applying Corollary [23] 0^(1) times. 



Landazuri and Seitz state their result for groups Gad(Fq)+ witii Gad of adjoint type, 
but as they point out the general case follows immediately by Schur's lemma because 

G(F,)+/Z(G(F,)+) = Gad(F,)+. 



APPROXIMATE SUBGROUPS OF LINEAR GROUPS 33 

we see that \S'^''\ ^ |G|^"^ for some N2 <r Ni, where 6 := d/10dim(G) 
is the same quantity as in the proof of Theorem 17.11 Applying the result 
of Gowers exactly as before we obtain S"^ ^ = q^ thereby confirming the 
result. n 

Other results in a similar vein are available. For example if a and b 
are fixed elements of SL„(Z) generating a Zariski-dense subgroup then we 
may look at these a and b when reduced to lie in SLji(Fp). By the strong- 
approximation result of Matthews- Weisfeiler-Vasserstein |44j a and b gener- 
ate SL„(Fp) for p sufficiently large, and applying the Tits alternative [SS] 
one may show that there are ^ p^ distinct words w{a,b) of length clogp 
(here, rj and c may depend on a and b as well as on n). Proceeding exactly 
as above, one may then confirm that every element of SL„(Fp) is a word of 
length at most Clogp in these generators a,b. We leave the details of this 
verification to the reader. 

8. The sum-product theorem over Fp 

The aim of this section is to show how the sum-product theorem over 
Fp follows from our main results. We begin by recalling the statement of 
the sum-product theorem, which was first established in p!0] for "reasonably 
large" sets and then in [9j in full generality. 

Theorem 12.71 (Sum-product theorem over Fp). Let p be a prime, and sup- 
pose that A is a finite subset o/Fp such that \A ■ A\,\A-\- A\ ^ -ft^j^l- Then 
either \A\ ^ K^ or \A\ ^ K'^p. 

We will need the so-called "Katz-Tao lemma" jlO^ [38] . 

Lemma 8.1 (Katz-Tao Lemma). Let A be a finite subset of some field k 
and suppose that \A ■ A\,\A + A\ ^ ^|^|- Then there is a set A' (^ A with 
\A'\ ^ iir~'^|74| such that for any rational function ip : k^ — > A; U {00} we 
have the estimate \il}{A')\ ^ K^^^^'>\A'\, where 

ip{A') := {-0(01, . . . , am) : ai, . . . , flm G Al}. 

In proving Theorem 12.71 we may replace A by this set A'. Henceforth, 
then, we assume that 

\i^[A)\ i^ K°^^^^\A\ (8.1) 

for all rational functions tj: : F™ — > Fp U {00}. We may also assume that 
O^A. 

Consider the set X C SL2(Fp) defined by 

^■^\\a 1+^203 ) ■ ai,«2,Q3 £ A. 

If one imagines A to be an "approximate subfield" of Fp then X might be 
considered thought of as an appropriate definition of SL2 (A) . 
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It is clear from ([HTT]) that \X^\ ^ K^\X\. Applying Corollary El with 
G = SL2(Fp) it follows that one of the following occurs: 
(i) |X| ^ K^; 

(ii) |X|^i^-<=^|SL2(Fp)|,or 
(iii) X does not generate SL2(Fp). 

Obviously (i) implies that \A\ ^ K*-^ , whilst (ii) implies that \A\ ^ K^^ p. 
It remains, then, to rule out (iii). For this we use Dickson's classification 
of subgroups of SL2(Fp) (see I22j). In fact, we only need the following 
consequence, reported for instance in [6] Proposition 3]. 

Proposition 8.2. Suppose that H is a proper subgroup of SL2(Fp) with 
\H\ > 60. Then H is 2-step solvable, and hence 

[[hi,h2],[h3,h^]]=id (8.2) 

for all /ii , /i2 , /13 , /14 € H. 



Supposing that X generates such a group H, consider the condition (18. 2p 
with /ii, /i2) ^3; ^4 being arbitrary elements of X parametrised by twelve 
elements ai, . . . ,ai2 of A. Clearing denominators, this yields a polynomial 
(/) : Fp^ -^ ¥p of degree 0(1) which vanishes identically on A^'^. Since 
SL2(Fp) is not solvable, this polynomial cannot vanish identically. We may 
then apply the following well-known lemma, which is proved in [H Lemma 
2.1]. 

Lemma 8.3. Let F be any field and suppose that (p : F*" — )• F is a polyno- 
mial in variables xi,. . . ,Xm- Suppose that the degree degj 4> in each of the 
variables Xi is at most d, and that there is a set S* C F with 15*1 > d such 
that 4>{si, . . . , Sm) = whenever si, . . . ,Sm & S. Then cp is identically zero. 

It follows immediately from this that condition (iii) can only hold if |^| = 
0(1). This completes the proof of Theorem 12.71 D 

Appendix A. Quantitative algebraic geometry via ultrafilters 

The purpose of this appendix is to use ultrafilter methods to establish 
the quantitative algebraic geometry results in Lemmas 13.21 13.413.51 13.71 13.81 
13.101 13.111 and Lemma I3.12[ In principle, all of these arguments could be 
replaced by more involved effective analogues, but we do not do this here. 
Doing so would make all the constants used in the main theorem (Theorem 
12.31 computable in principle. 

The key point here is that many basic concepts in the theory of algebraic 
varieties (such as dimension, irreducibility, or degree) are continuous with 
respect to the operation of taking ultralimitqj. As a consequence of this and 
a compactness- type argument, many qualitative statements regarding such 
concepts automatically (but ineffectively) have quantitative analogues that 



This is analogous, but not quite identical to, the more well-known fact that such concepts 
are continuous with respect to operations such as projective limits; see [5]. 
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are uniform over all choice of (algebraically closed) base field and all choices 
of coefficients used to define the varieties at hand. 

We turn to the details. We will need a non-principal ultrafilter Uoo £ 
/3N\N, i.e. a collection of subsets of N with the following properties: 

• No finite set lies in Ooo- 

• If j4 C N is in a^o , then any subset of N containing A is in aoo . 

• If A,B lie in aoo, then An B also lies in aoo- 

• If j4 C N, then exactly one of A and N\A lies in aoo- 

An easy consequence of these axioms is that if AdB € Ooo then at least one 
of A and B lies in aoo- Given a property P{a) which may be true or false for 
each natural number a, we say that P is true for a sufficiently close to aoo 
if the set {a € N : P{a) holds} lies in Ooo- The existence of a non-principal 
ultrafilter aoo is guaranteed by the axiom of choice. We fix aoo throughout 
the rest of this appendix. We make the important observation that if Xa 
takes on only a finite number of values for a sufficiently close to aoo (e.g. if 
Xa ranges in a discrete space and is also bounded), then it is in fact constant 
for a sufficiently close to aoo - We write this constant as linia^aoo ^a ■ 

Given any sequence (Xq)q,(=n of sets, we define the ultraproduct Ha-i-a -^a 
to be the space of equivalence classes of tuples {xa)a&i with x^ € X^ for 
all a € N, where the equivalence relation is given by requiring {xa)a&n and 
{Vaja&i to be equivalent if Xa = Ha for all a sufficiently close to aoo- We call 
such an equivalence class the ultralimit of the sequence {xa)aeN and write 
it as liuia^aac, ^a- Note that the ultraproduct na->-a -^a and ultralimit 
linia^aoo ^a remain well defined even if Xa or Xa are only defined for a 
sufficiently close to aoo- 

We make the basic observation that two ultraproducts 

n Xa, n ^" 

agreqlj if and only if X^ = Y^ for all a sufficiently close to ao- 

Any operation or relation on a sets Xq., a G N carries over to the ul- 
traproduct, and a famous theorem of Los asserts that any statement in 
first-order logic that is true for all the Xa (or for Xa with a sufficiently 
close to aoo) is also true in the ultralimit. For instance, if ka is a sequence 
of algebraically closed fields, then the ultraproduct k := ria-^-o ^a is also 
an algebraically closed field, because the property of being an algebraically 



Strictly speaking, this statement is not precisely correct, because the equivalence rela- 
tions used to define the two ultraproducts are defined on distinct domains. But if one 
extends the equivalence relation to a common domain, such as ((Jq ^a U Va)'*, then the 
statement becomes valid. Alternatively, one can embed all the groups, fields, varieties, 
etc. one is interested in studying in a single standard universe U, which is assumed to 
be a set, and define the ultralimit equivalence relation on all sequences {xa)aea^ in U^ 
(or more generally, on sequences in U defined for a sufficiently close to Qoo)- We will not 
dwell on this foundational issue in the rest of this paper, as it makes no significant impact 
on the actual arguments here. 
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closed field can be expressed as a set of first-order sentences involving the 
field operations. 

Note that if fc := na->a ^a is an ultraproduct of algebraically closed 
fields, then we have some canonical identifications 

A"(A;) = Y[ A"(fc«) 

and 

P"(A;) = JJ P"(fc«) 

for fixed n. Furthermore, if V^ is a sequence of affine varieties in A"(A;q), 
with complexity bounded uniformly in a, we see that the ultraproduct 
V := Ha-^a ^ ^^ ^^ affine variety in A"(fc). This is basically because 
the ultralimit of polynomials of bounded degree remains polynomial. Con- 
versely, every affine variety in A"(/c) can be expressed as an ultralimit of 
affine varieties in A"(A;q-) of uniformly bounded complexity for a sufficiently 
close to Ooo- Similar claims of course hold for projective varieties, quasipro- 
jective varieties, and constructible sets. 

Now we investigate various continuity properties of the ultralimit of va- 
rieties. We begin with the continuity of dimension. 

Lemma A.l (Continuity of dimension). Let ka be a sequence of algebraic- 
ally closed fields, let n ^ 1, and let Va C A'^{ka) be a family of affine 
algebraic varieties of complexity uniformly bounded for a sufficiently close 
to aoo- Write k := H^^^^ka and V := H^^a^Va. Then dim(y) = 
lima->a^ dim(FQ,). In other words, we have dim(F) = dim(ya) for all a 
sufficiently close to Ooo- 

Similarly with affine varieties replaced by projective varieties, quasipro- 
jective varieties, or constructible sets. 

Proof. We begin with the claim for affine varieties. 

We induct on dimension n. The case n = is trivial, so suppose that 
n ^ 1 and the claim has already been shown for n — 1. Write d for the 
dimension oi V. If d = — 1, then V is empty and so Va must be empty for 
all a sufficiently close to Ooo, so suppose that d ^ 0. Since V has dimension 
d, we see from standard algebraic geometry theory that the slices 

Vt := {x e A'^-\ka) : (x,t) G F} 

all have dimension d — \ (or are all empty) for all but finitely many values 
ti, . . . ,tr of t G k, and the exceptional slices Vt- have dimension at most d. 
If the Vt for t ^ ti, . . . ,tr are all empty, then one of the exceptional slices 
Vt^ has to have dimension exactly d. As k is the ultraproduct of the ka, we 
can write ti = limc^Q,^ t^^i for each 1 ^ i ^ r. 

Suppose first that the Vt have dimension d — 1 for all t ^ ti, . . . ,tr. We 
claim that for a sufficiently close to Uoo, the slices (V^)t„ have dimension 
d — 1 whenever ta 7^ ta,i, • • • ,ta,r- Indeed, suppose that this were not the 
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case. Carefully negating the quantifiers (and using the ultrafilter property), 
we see that for a sufficiently close to aoo, we can find ta 7^ ia,i, • • • , to,r 
such that (V^)j^ has dimension different from d — \. Taking ultraproducts 
and writing t := lima^Q^ i^, we see from the induction hypothesis that Vt 
has dimension different from d — 1, contradiction. Thus (Ki)t has dimension 
d—\ whenever t ^ ta^i, ■ ■ ■ , ta,r and o is sufficiently close to Ooo (uniformly 
in t). A similar argument also shows that {Va)t has dimension at most d 
whenever t G {i«,i, . . . , i«,r} and a is sufficiently close to Ooo- Hence Va has 
dimension exactly d. 

A similar argument applies when Vt is empty for all t 7^ ti,. . . ,tr, and 
has dimension exactly equal to d for at least one of the ti, . . . ,tr, and at 
most d for the other slices. This establishes the claim for affine varieties. 

The projective case follows from the affine one by covering projective 
space by finitely many copies of affine space. 

The quasiprojective case follows from the projective one by expressing 
a quasiprojective variety V of dimension d as the projective variety of di- 
mension at most d, with a projective variety of dimension at most d — 1 
removed; this can be achieved by starting with the Zariski closure of V and 
decomposing into irreducible components. 

The constructible set case then follows from the quasiprojective one by 
expressing a constructible set of dimension d as the union of finitely many 
quasiprojective varieties of dimension at most d, with at least one of these 
varieties having dimension exactly d. D 



This already gives a quick proof of Lemma [ 

Proof of Lemma lS.Si Suppose this lemma failed. Carefully negating the 
quantifiers (and using the axiom of choice), we may find a dimension n, 
a sequence Va C A'^{ka) of dimension constructible sets and uniformly 
bounded complexity over algebraically closed fields ka, such that |Vq,| — )• cxd 
as a — >■ 00. We pass to an ultralimit to obtain a constructible set V := 
IIq^o ^' which by Lemma lA. II has dimension 0, and is thus finite. But 
then this forces Va to be finite for a sufficiently close to Ooo (indeed we have 
\Va\ = \V\ in such a neighbourhood), contradiction. D 

Now we study continuity of irreducibility. We will shortly establish the 
following result. 

Lemma A. 2 (Continuity of irreducibility). Suppose that Va C A'^{ka) are 
affine varieties of uniformly bounded complexity over algebraically closed 
fields ka, and letV := Ila^a ^ ^^ ^^^ ultraproduct. Then V is irreducible 
if and only if Va is irreducible for all a sufficiently close to aoo ■ Similarly 
for projective or quasiprojective varieties instead of affine varieties. 

This result is however a little bit more difficult to establish than Lemma 
lA.ll because it requires one to understand the relationship between the 
complexity of a variety and its degree. Recall that the degree of an affine 
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variety V C A"'(k) of dimension d is the cardinality of |y nTy|, where M^ is a 
generic affine n — d-dimensional subspace of A"(/i;) (i.e. for ah W in the afiine 
Grassmanian Grassn,n-dik) of afiine n — d-dimensional affine subspaces of 
A'"(fc), outside of a subvariety of Grassn,n-d(^) of strictly smaller dimension). 
It is a standard fact in algebraic geometry that the degree is well-defined as 
a natural number. We also have the following nontrivial fact. 

Theorem A. 3 (Degree controls complexity). Let V be an irreducible affine 
variety in A"'{k) of degree D over an algebraically closed field k. Then V has 
complexity at most Cn,D for some constants n,D depending only on n,D. 

Proof. I I It suffices to show that V can be cut out by polynomials of degree 

D, since the space of polynomials of degree D that vanish on ^ is a vector 
space of dimension bounded only by n and D. 

Let V have dimension d. We pick a generic affine subspace W of A;" of 
dimension n — d — 2, and consider the cone C{W, V) formed by taking the 
union of all the lines joining a point in VK to a point in V. This is an 
algebraic image oi W x V x k and is thus generically an algebraic set of 
dimension n — 1, i.e. a hypersurface. Furthermore, as V has degree D, it is 
not hard to see that C{W, V) has degree D as well. Since a hypersurface is 
necessarily cut out by a single polynomial, this polynomial must have degree 
D. 

To finish the claim, it suffices to show that the intersection of the C{W, V) 
as W varies is exactly V. Clearly, this intersection contains V. Now let p 
be any point not in V. The cone of V over p can be viewed as an algebraic 
subset of the projective space P"~^(A:) of dimension d; meanwhile, the cone 
of a generic subspace W of dimension n — d — 2 is a generic subspace of 
P"~^(/c) of the same dimension. Thus, for generic W, these two cones do 
not intersect, and thus p lies outside C{W, V), and the claim follows. D 

Remark. There is a stronger and more difficult theorem that asserts that 
if the degree of a scheme in /c" is bounded, then the complexity of that 
scheme is bounded as well; see [Ml Corollary 6.11]. We will not need this 
stronger statement here. The converse statement (that complexity controls 
degree) is also true, being a corollary of Lemma 13.51 

Proof of Lemma \A.S\ . We first establish this claim for affine varieties. 
The "if" direction of the lemma is the easiest. Suppose then that V is 
reducible. Then it is the proper union of affine varieties V^i, V2. Each Vi, V2 
can be expressed as the ultralimit of affine varieties Va^i, Va,2 of bounded 
complexity, and one easily sees that Va is the proper union of Va,i and Va^2 
for a sufficiently close to aoo- Thus Va is reducible for such a, and the claim 
follows. 

Now suppose that V is irreducible; our task is to show that the Va are 
irreducible for a sufficiently close to ao- 



We thank Jordan EUenberg and Ania Otwinowska for this argument, which goes back 

to He]. 
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Let d and D be the dimension and degree of V, thus |y n Ty| = D for 
generic W € Grass(/c). Undoing the ultrahniit using Lemma fA.H we see that 
for a sufficiently close to oq, \Va n Wa\ = D for generic Wa G Grass(A;Q). In 
other words, V^ has degree D. The same is clearly true of any irreducible 
subvariety of Ua ^ Va with dim.{Ua) = dim(Va). By Theorem lA.3l then, any 
such subvariety Uq, will have complexity bounded by Cn,D uniformly in a. 
For each a select some Ua, and let U be the ultraproduct these Ua- Then 
by Lemma lA.ll and the uniform complexity bound, f/ is a d-dimensional 
subvariety of V, and thus must equal all of V by the irreducibility of V. But 
this implies that Ua = Va for all a sufficiently close to ao, and the claim for 
affine varieties then follows. 

The claim for projective varieties then follows by covering projective space 
by a finite number of copies of affine space. The claim for quasiprojective 
varieties then follows by writing an irreducible quasiprojective variety as an 
irreducible projective variety (i.e. the Zariski closure) with some varieties of 
strictly smaller dimension removed. D 

This has the following consequence, used at several points in the paper. 

Lemma A. 4. Let V C A" (A;) be an affine variety of complexity at most M 
over an algebraically closed field k. Then V can be expressed as the union 
of at most Om{\) irreducible varieties of complexity at most Oa/ (1). 

Similarly with affine varieties replaced by projective or quasiprojective va- 
rieties. 

Proof. As n is bounded by M, it suffices to prove the claim for a fixed n. 
We shall just establish the claim for affine varieties, as the projective and 
quasiprojective cases are similar. 

Fix n and M, and suppose the claim failed. Carefully negating all the 
quantifiers (and using the axiom of choice), we see that there exists a se- 
quence Va C A"(A:q,) of affine varieties of uniformly bounded complexity, 
such that Va cannot be expressed as the union of a or fewer irreducible 
affine varieties of complexity at most a. Now we pass to an ultralimit, 
obtaining an affine variety V := linia^aca ^ '^ A"(/c). As V is an affine 
variety, standard algebraic geometry allows one to write V as the union of 
finitely many irreducible affine varieties Vi, . . . , Vm- Each of these varieties 
Vi is the ultraproduct of affine varieties Va^i C hJ^{ka) of bounded complex- 
ity; by Lemma Fa. 2 1 the Va,i will be irreducible for a sufficiently close to a^o- 
On the other hand, the Va are the union of the Va,i for a sufficiently close 
to Ooo- This contradicts the construction of the V^, and the claim follows. 
D 

As a consequence of the above machinery we have the following result 
asserting the continuity of Zariski closure and Zariski density. 

Lemma A. 5. Suppose that Va C P"(/cq,) are varieties of uniformly bounded 
complexity over algebraically closed fields ka, and let V := Yla^a ^a be 
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the ultraproduct. Then V = Ha-j-a ^a, where V and Va are the Zariski 
closures of V, Va respectively. 

Similarly, i/ V^ C Wa are varieties of uniformly bounded complexity over 
ka, and V := Yla-^a ^a and W := na^.Q ^«? then V is Zariski-dense in 
W if and only if there exists a finite M such that V^ is M -Zariski-dense in 
Wa for all a sufficiently close to Ooo- 

Proof. We just prove the first claim, as the second claim is similar. One 
can represent V as the union of Zariski-dense open subsets Vi of irreducible 
projective varieties Vi, with V then being the union of the Vi. One can 
then view Vi as the irreducible projective variety Vi with a subvariety of 
strictly smaller dimension removed. We can express the variety Vi as an 
ultraproduct Ha^a ^«a- Using Lemma [A. II and Lemma [A. 21 we thus see 
that for a sufficiently close to aoo, the Via are irreducible, and Va is the 
union of Zariski-dense subsets of Via- Thus Va = Ui ^a ^^ such a, and the 
claim follows. D 



Now we can quickly prove Lemma [ 

Proof of Lemma \3.SX Suppose this lemma failed. Carefully negating 
the quantifiers (and using the axiom of choice), we may find a dimension 
n, a sequence Va C P'^(fea) of constructible sets and uniformly bounded 
complexity over algebraically closed fields ka, such that the Zariski closure 
Va of Va has complexity at least a. We pass to an ultralimit to obtain a 
constructible set T^ := Ha-i^a ^- -^y Lemma [A. 51 the Zariski closure V is 
the ultraproduct of the Va, and hence the Va have bounded complexity, a 
contradiction. □ 

Now we apply the ultralimit machinery to regular maps. Given a collec- 
tion of maps 4)a '. Xa — )■ Ya for a sufficiently close to Ooo , we can construct 
the ultralimit cf) := Ivcua^aao 't'a^ defined as the map (p : X ^ Y from the 
ultraproduct X := Wa^a -^^ to the ultraproduct Y := Hq^q ^a by the 
formula 

(j){ lim Xa) '.= lim (j)a{Xa) 

for any sequence Xa € Xa- It is easy to see that this limit map is well 
defined. 

From Definition 13.31 we obtain the following basic lemma. 

Lemma A. 6 (Ultralimits of regular maps). For a sufficiently close to Ooo, 
let Va, Wa be varieties of uniformly bounded complexity over an algebraically 
closed field ka, and let (j)a '■ Va ^ Wa be a regular map of uniformly bounded 
complexity. Then the ultralimit cj) := liuia^aoo 4>a is a regular map from the 
ultraproduct V := Yla^a ^a {which is a variety over k := Yla^a ^a) ^'^ 
the ultraproduct W := !!«->■« ^a {which is also a variety over k). 

Conversely, if V := Jla-s-a ^a o,nd W := Ha-s-a ^a o,re algebraic va- 
rieties over k = Ha-^-a ^a, and (p : V ^ W is a regular map, then one can 
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write (j) = linia^aac 4>a, where for a sufficiently close to Uoo, (pa '■ Va ^- Wa 
is a regular map of complexity bounded uniformly in a. 

Now we can prove Lemma 13.41 

Proof of Lemma \3.4\ We begin with the first claim. If this claim failed, 
then there exists a two sequences of regular maps (pa '■ Va ^ Wa and 
ipa '■ Ua ^ Va of Uniformly bounded complexity (so in particular Ua,Va, Wa 
also have uniformly bounded complexity) , such that ipa ° (pa '■ Ua ^ Wa is 
not given by a regular map of complexity at most a. But then by Lemma 
IA.6I we may take ultralimits and ultraproducts to create two regular maps 
(f) : V ^ W and ^p : U ^ V in the obvious manner. From classical algebraic 
geometry we know that the composition tp o cf) : U ^ W is then regular, and 
so by another application of Lemma IA.6I we see that ipa° (pa '-Ua ^ Wa is 
a regular map of bounded complexity for a sufficiently close to Ooo , giving 
the desired contradiction. 

Now we prove the second claim, for images. If this claim failed, then we 
can find a sequence of regular maps 0q, : Fq, — > Wa of uniformly bounded 
complexity, such that (paiYa) is not a constructible set of complexity at 
most a. Taking ultralimits again, we obtain a regular map (p : V ^ W. 
From classical algebraic geometry we know that (piV) is a constructible set, 
which implies that for a sufficiently close to ao, (paiVa) is a constructible 
set of uniformly bounded complexity, giving the desired contradiction. The 
analogous claim for pre-images is proven similarly. D 

Now we can establish continuity of dominance under ultralimits. 

Lemma A. 7 (Continuity of dominance). Let (pa -Va ^ Wa he a sequence 
of regular maps of uniformly hounded complexity over an algebraically closed 
field ka- Let (p := limc^Q,^ (pa, V := Ha-^a^ ^a, W := Ha-^ao^ ^a, and 
k := !!«->.« ^a- Then (p : V ^ W is dominant if and only if (pa ■ Va — > Wa 
is dominant for all a sufficiently close to Ooo- 

Proof. Suppose first that (pa is dominant for all a sufficiently close to aoo- 
Then for a sufficiently close to aoo ^a is irreducible, hence V is irreducible 
by Lemma lA. 21 The Zariski closure of (paiVa) is Wa, and hence by Lemma 
lA.SI the Zariski closure of (piV) is W . Therefore (p is dominant. 

The converse claim follows by reversing all of these steps. D 

Now we prove Lemma FS.?! The key is to establish the following qualitative 
variant. 

Lemma A. 8 (Qualitative dimension lemma). Let V,W he varieties, and 
let (p : V ^ W be a regular map. Then there exists a Zariski open subset 
V' ofV, and a subvariety W' ofW of dimension at most dim(y), with the 
following two properties: 

• {Generic mapping) (p{V') C W' . 

• {Generic fibres) For any w G W' , the set {v ^V : (p{v) = w} is a 
constructible set of dimension at most dim(y) — dim(VF'). 



42 EMMANUEL BREUILLARD, BEN GREEN, AND TERENCE TAG 

If (p is dominant, then we may take W to be a Zariski-dense subset ofW. 

Proof. Let d be the dimension of V. We may view V as the union of 
Zariski-open subsets of irreducible projective varieties Vi of dimension at 
most d. We may remove ah varieties Vi of dimension strictly less than d by 
placing them in the exceptional set V\V'. We may then work with just a 
single Vi, as the general case follows by taking unions. Thus, V is now a 
Zariski-dense subvariety of an irreducible projective variety V. 

We may replace W by the Zariski closure of 4>{V)- When one does so, (j) 
becomes a dominant map; also we now have dim(Ty) ^ dim(y). But now, 
a standard result in classical algebraic geometry (see |521 §1.6.3]) shows that 
there exist Zariski-open subsets V',W' of V,W respectively, such that (j) 
restricts to a dominant map (p' : V ^ W, whose fibres {v £ V : (f>{v) = w} 
have dimension dim(y) — dim(PF). The claims then follow. D 



Proof of Lemma \3.7\ This is deduced from Lemma IA.8I by a very similar 



ultralimit argument to previous arguments, so we shall only give a sketch 
here. If the first claim failed, then one can find a sequence of regular maps 
(j)a '■ Va ^ Wa of Uniformly bounded complexity, such that one cannot find 
a a- Zariski open subset V^ of Va and a subvariety W^ of dimension at most 
dim(y) with complexity at most a such that (piV^) C W^ and for every 
w G W^, the set {v £ V^ : (l){v) = tf} is a constructible set of dimension 
at most dim(V^) — dim(TV^) and complexity at most a. One then takes 
ultralimits of the (pa^Va, Wa to create a regular map cp : V ^ W. Applying 
Lemma IA.8I and undoing the ultralimit (using Lemma lA.ll) we obtain the 
required contradiction. 

The argument when the second claim fails is similar, except that now the 
(pa are also dominant maps, and Wa is required to be a- Zariski dense in W. 
One then argues as before but also uses Lemma IA.7I and Lemma IA.5I D 

In a similar spirit. Lemma 13.81 follows by the usual ultralimit argument 
from the following qualitative lemma. 

Lemma A. 9 (Qualitative slicing lemma). Let V,W be varieties, and let S 
be a subvariety ofVxW of dimension strictly less than dim(y) + dim(l^). 
Then for generic v £ V , the set {w £ W : {v,w) £ S} is a constructible set 
of dimension strictly less than dim(iy). 

Proof. The projection map ir : S ^ V that maps {v, w) to v is a regular 
map. Let V be the effective image of this map, given by Lemma lA. 81 The 
claim then follows from that lemma (dividing into two cases, depending on 
whether the effective image V has dimension equal to that oi V, or has 
strictly lower dimension). D 

Finally, we discuss algebraic groups. Prom Definition 13.91 and Lemma lA. 61 
we have the following lemma. 
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Lemma A. 10 (Ultralimits of algebraic groups). For a sufficiently close to 
OoD, let Ga be an algebraic group of uniformly bounded complexity over an 
algebraically closed field ka- Then the ultraproduct G := ria-s-a ^a is an 
algebraic group also. Conversely, every algebraic group over an ultraproduct 
G = Ha-^-a ^a of algebraically closed fields is an ultraproduct of algebraic 
groups Ga of bounded complexity over k^ ■ 

It is clear that if G is an algebraic group, then the centralisers Z[a) 
and conjugacy classes a are constructible sets, and that the normaliser 
and centraliser of an algebraic group H is another algebraic group; and so 
Lemma 13.101 follows from Lemma lA. 101 and the usual ultralimit argument. 

Now we turn to Lemma 13.111 This will be deduced from the following 
qualitative fact. 

Lemma A. 11 (Zariski closure of groups). Let A <Z G be any subgroup of an 
algebraic group G. Then the Zariski closure A of A is an algebraic subgroup 
ofG. 

Proof. A contains the identity and is closed with respect to inverses, so 
A is also. For any a G ^4, the map g ^^ ag \s a. regular isomorphism that 
preserves A, and thus must also preserve A; thus aA = A. On the other 
hand, the set {5 € G : gA = A} is a variety (being the intersection of a 
family of closed subvarieties in G) that contains A, and so must contain A. 
Thus A is closed under multiplication, and the claim follows. D 

Proof of Lemma \3.11\ Suppose the claim failed. Then there exists an M 
and a sequence of algebraic groups Ga of complexity at most M, subvarieties 
Va of complexity at most M, and symmetric subsets Aa containing id in Ga 
with A'^ C Va, such that Aa is not contained in any algebraic subgroup of Ga 
contained in Va of complexity at most a. Let G, A, V be the ultraproducts of 
Ga,Aa,Va respectively. By construction, A contains the identity, is closed 
with respect to inverses, and A"* C V for any finite m. In particular, the 
group {A) generated by A is contained in V. By Lemma I A. IH the Zariski 
closure H := (A) of this group is an algebraic subgroup of G contained in 
V. Undoing the ultraproduct using Lemma lA. 101 we can express H as the 
ultraproduct of algebraic subgroups Ha of Ga which contain A and are of 
bounded complexity for a sufficiently close to Uoo- But this contradicts the 
construction of Aa- □ 

By using very similar ultralimit arguments to those already employed, we 
see that Lemma 13.121 follows from the next result, which might be called a 
qualitative product-conjugation phenomenon for subvarieties. 

Lemma A. 12. Let G be an algebraic group. Let V, W be algebraic varieties 
in G such that 

< dim(y),dim(VF) < dim(G). 

Then at least one of the following holds: 



44 EMMANUEL BREUILLARD, BEN GREEN, AND TERENCE TAG 

• {G is not almost simple) G contains a proper normal algebraic sub- 
group H of positive dimension. 

• (Escape) For generic a G G, there exists an essential product {V"" ■ 
Wy'^'^ of V"" := a~^Va and W of dimension strictly greater than 
dim{W). 

Proof. Suppose that G is almost simple, that is to say it contains no normal 
subgroups H of positive dimension. 

We can express V and W as unions of Zariski-open subsets of irreducible 
subvarieties of G. By restricting to just one such top-dimensional subvariety 
for both V and W, we may assume that the Zariski closures V,W oi V,W 
in G are irreducible. 

Suppose that V"" is such that (V^ ■ wy^^ has dimension less than or equal 
to dim(Ty). As W is irreducible, the translates v"'W for v"^ G V"" are also 
irreducible with dimension dim(VF). Suppose that the set S := {v'^W : v"" G 
y°} of such translates is infinite. Then any essential product (y^-WY^^ must 
contain a Zariski-dense subset of all but finitely many of such translates, 
and so has dimension strictly greater than dim(Vl^), contradiction. Thus 
S is finite. For each W' in S, the set {v"- G V"- : v°'W = W'} is easily 
seen i to be a constructible subset of V"", and the union is all of V^ . Thus 
there exists W' G S such that {u" G V"" : v^W = W'} is Zariski-dense 
in V , i.e. v'^W = W for generic v G V . In particular, this implies that 



-l„/^a- 



[y ^V )"'W = W for generic v,v' G V. 

The set S := {g € G : gW = W} is an intersection of closed varieties 
Ww^^, w gW \n G and is thus closed (by the Noetherian condition); it is 
clearly a group, and is thus an algebraic subgroup of G. Since each of the 
varieties Ww~^ has dimension dim(VF), S has dimension at most dim(PF). 

From the previous discussion we see that if {V^ ■ wy^^ has dimension 
less than or equal to dim(l/l/^), then (v~^v'y G S for generic v,v' G V. In 
particular S cannot have dimension 0. 

If the escape property fails, we conclude that (v^^v'y G S for generic 
v,v' G V and a € G. Call an a good if we have v^^v' G S"* for generic 
V, v' G V. Then a generic element of G is good. Define a good set to be an 
intersection of finitely many S"" , such that a is good. By construction, all 
good sets H are closed algebraic subgroups of G, and they have the property 
that v~^v' G H for generic v,v' G V. 

By the Noetherian condition, there exists a good set H which is minimal 
with respect to set inclusion. Observe that H"^ is also good for generic 
a, and hence by minimality H = H"^ for generic a. In other words, the 
normaliser N[H) is Zariski dense in G. But N{H) is also a closed variety, 
and so N{H) = G, thus H is normal in G. By the almost simplicity of 
G, this forces H to be zero-dimensional. But this contradicts the fact that 
v^^v' G H for generic v, v' G V, since dim(y) > 0, and the claim follows. D 



Here we use the Noetherian condition that there does not exist an infinite descending 
chain of closed varieties. 
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Appendix B. A lemma of Malcev and Platonov 

In this short appendix we sketch a brief proof of Lemma [6. 31 whose state- 
ment we recah now. 

Lemma 16.31 Any virtually solvable subgroup of GL„(C) {that is to say, 
any subgroup o/ GL„,(C) with a solvable subgroup of finite index) contains 
a normal subgroup of index 0„,(1) which is simultaneously triangularis able, 
hence solvable. 

Proof. Let G be the Zariski closure of T. By inducting on n we may 
assume that G, together with all of its subgroups of index at most n!, acts 
irreducibly on C". Let Go be the (solvable) connected component of the 
identity of G and let U be its unipotent radical. If U is non-trivial, the 
subspace V of C" consisting of all x for which ux = x for all u € ?7 must 
be non-trivial. Since G normalizes U we have g~^ugx = x for all g (z G, 
which implies that gV C V . This, of course, contradicts the irreducibility 
assumption. It follows that U is trivial and thus Go is a torus which, after 
a change of basis, is diagonal. Note that G is contained in the normalizer 
N{Gq) of Go inside GL„(C). The centralizer Z{Go) of Go in GL„(C) is easy 
to compute explicitly: it is a block diagonal subgroup. Moreover N{Gq) 
permutes these diagonal blocks, and hence |A^(Go)/.^(Go)| ^ n\. Passing to 
a subgroup of G of index at most n\ we may thus assume that G centralizes 
Gq. By assumption G acts irreducibly: this forces Go to be trivial as G 
fixes the weight spaces of Gq. It follows that Go is trivial and hence that 
G is finite. An appeal to Jordan's theorem, which states that G then has 
an abelian, hence simultaneously diagonalisable, subgroup of index 0„(1), 
concludes the argument. D 



Appendix C. Generating finite index subgroups 

The purpose of this appendix is to establish the following well-known 
lemma (which appears for instance as [53', Lemma 4.8] or as [121 Lemma 
6.7]), required towards the end of the proof of Theorem 12.51 

Lemma C.l (Bounded index subgroups have bounded length generators). 
Let T be a group, and S a finite symmetric generating set containing the 
identity. Let Tq a subgroup of index d. Then S'^'^^^ contains a generating 
set for Fq. 

Proof. By the pigeonhole principle there is some n ^ d — 1 such that 
|5"ro/ro| = \S"'+^To/ro\. For this n we have S^Fq = S^+^Fq. In particular, 
S"'Tq is invariant under left multiplication by S and is therefore the whole 
of F. It follows, of course, that S'^^-'^Fq = F. Let 7^ € S'^~^ iov i = 1, . . . ,d 
be a full set of coset representatives of Fq inside F with 71 G Fq. For each 
i = 1, . . . ,d and each s £ S let j = j{i, s) G {1, . . . ,d} be that index for 
which s7jFo = 7jFo. Then it is straightforward to verify that the elements 
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7~ S7i € FonS'^^^^ together with 71 form a generating set of Fq; just rewrite 
any word with letters in S in terms of those elements. D 



[1 
[2: 

[3; 
K 

[s; 

[6; 
[7; 
[s; 
[9; 

[lo; 

[11 

[12: 
[is; 

[14 

[is: 
[16; 

[17; 
[is; 

[19 

[20; 

[21 

[22; 

[23' 

[24^ 
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